<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-GB">
	<id>https://t2bwiki.iihe.ac.be/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Romain+Rougny</id>
	<title>T2B Wiki - User contributions [en-gb]</title>
	<link rel="self" type="application/atom+xml" href="https://t2bwiki.iihe.ac.be/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Romain+Rougny"/>
	<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/Special:Contributions/Romain_Rougny"/>
	<updated>2026-05-31T00:11:14Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.45.3</generator>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=748</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=748"/>
		<updated>2016-11-04T15:58:45Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Job Deletion */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : mshort [ m0 , m1 , m2 , m3 ] , mlong [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@mshort.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;mshort.iihe.ac.be :  m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be &amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;mlong.iihe.ac.be : m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highbw&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid, subgroup of highmem, with WNs having 10Gb/s bandwidth access to storage.&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3512&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1984&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 448&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|25px|line=1|]] If you want to send more than 2500 jobs to the cluster, write all qsub commands in a text file, and use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; (more info [[bigSubmission|here]]).&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
=== Job Deletion ===&lt;br /&gt;
Use the following command:&lt;br /&gt;
 qdel &amp;lt;JOBID&amp;gt;&lt;br /&gt;
To delete all your jobs, be patient while using the following line:&lt;br /&gt;
 for j in $(qselect -u $USER);do timeout 3 qdel -a $j;done&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=747</id>
		<title>Getting a certificate for the T2</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=747"/>
		<updated>2016-11-04T12:46:28Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* First time certificate */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;If you need grid access on the T2, please follow all the steps below: &amp;lt;br&amp;gt; &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== First time certificate ===&lt;br /&gt;
# [[Obtaining_a_certificate | Get a certificate (new)]]. &#039;&#039;If you already have a CERN grid certificate, you can temporarily use it. But for accounting reasons, we need you to get a Belgian certificate &#039;&#039;&lt;br /&gt;
# Extract your certificate from your browser: &#039;&#039;&#039;[[Chrome]]&#039;&#039;&#039; | &#039;&#039;&#039;[[Firefox]]&#039;&#039;&#039;&lt;br /&gt;
# [[certificate_to_UI | Put your certificate on the UIs]]&lt;br /&gt;
# Register to the VO&lt;br /&gt;
#:* &#039;&#039;&#039;CMS&#039;&#039;&#039;&lt;br /&gt;
#:*# [[Register_to_the_CMS_VO|Register to the CMS VO]]&lt;br /&gt;
#:*# [[SiteDB | Check if your certificate is ok on SiteDB]]. Note the DN.&lt;br /&gt;
#:*# [[CERN_certificate_management | Check that your certificate is the only one registered on the CERN website.]]&lt;br /&gt;
#:* &#039;&#039;&#039;IceCube&#039;&#039;&#039;&lt;br /&gt;
#:** [[Register_to_the_IceCube_VO|Register to the IceCube VO]]&lt;br /&gt;
#:* &#039;&#039;&#039;Solid&#039;&#039;&#039;, &#039;&#039;&#039;Others&#039;&#039;&#039;&lt;br /&gt;
#:** [[Register_to_the_Beapps_VO|Register to the Belgian VO (beapps) ]]&lt;br /&gt;
# Send a mail to the T2B support (grid_adminATlistserv.vub.ac.be) with your DN in order to have write access on the T2.&lt;br /&gt;
# [[Check_Certificate_UIs | Check if everything works fine on the mX machines]]&lt;br /&gt;
&lt;br /&gt;
=== Certificate renewal ===&lt;br /&gt;
As we switched the company who provides us with certificates, you need to follow the &#039;First time certificate&#039; to have a new one, this until December 2016 at least.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Register_to_the_Beapps_VO&amp;diff=746</id>
		<title>Register to the Beapps VO</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Register_to_the_Beapps_VO&amp;diff=746"/>
		<updated>2016-11-04T12:45:25Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: Created page with &amp;quot;* Register with your certificate here :  https://voms01.begrid.be:8443/voms/beapps/user/home.action  * Test on m machines :  &amp;lt;pre&amp;gt; voms-proxy-init --voms beapps &amp;lt;/pre&amp;gt;&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* Register with your certificate here :&lt;br /&gt;
&lt;br /&gt;
https://voms01.begrid.be:8443/voms/beapps/user/home.action&lt;br /&gt;
&lt;br /&gt;
* Test on m machines :&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
voms-proxy-init --voms beapps&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=745</id>
		<title>Getting a certificate for the T2</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=745"/>
		<updated>2016-11-04T12:42:05Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* First time certificate */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;If you need grid access on the T2, please follow all the steps below: &amp;lt;br&amp;gt; &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== First time certificate ===&lt;br /&gt;
# [[Obtaining_a_certificate | Get a certificate (new)]]. &#039;&#039;If you already have a CERN grid certificate, you can temporarily use it. But for accounting reasons, we need you to get a Belgian certificate &#039;&#039;&lt;br /&gt;
# Extract your certificate from your browser: &#039;&#039;&#039;[[Chrome]]&#039;&#039;&#039; | &#039;&#039;&#039;[[Firefox]]&#039;&#039;&#039;&lt;br /&gt;
# [[certificate_to_UI | Put your certificate on the UIs]]&lt;br /&gt;
# Register to the VO&lt;br /&gt;
#:* CMS&lt;br /&gt;
#:*# [[Register_to_the_CMS_VO|Register to the CMS VO]]&lt;br /&gt;
#:*# [[SiteDB | Check if your certificate is ok on SiteDB]]. Note the DN.&lt;br /&gt;
#:*# [[CERN_certificate_management | Check that your certificate is the only one registered on the CERN website.]]&lt;br /&gt;
#:* IceCube&lt;br /&gt;
#:** [[Register_to_the_IceCube_VO|Register to the IceCube VO]]&lt;br /&gt;
#:* Solid, Others&lt;br /&gt;
#:** [[Register_to_the_Beapps_VO|Register to the Belgian VO (beapps) ]]&lt;br /&gt;
# Send a mail to the T2B support (grid_adminATlistserv.vub.ac.be) with your DN in order to have write access on the T2.&lt;br /&gt;
# [[Check_Certificate_UIs | Check if everything works fine on the mX machines]]&lt;br /&gt;
&lt;br /&gt;
=== Certificate renewal ===&lt;br /&gt;
As we switched the company who provides us with certificates, you need to follow the &#039;First time certificate&#039; to have a new one, this until December 2016 at least.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=744</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=744"/>
		<updated>2016-11-03T15:04:21Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Job Deletion */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : mshort [ m0 , m1 , m2 , m3 ] , mlong [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@mshort.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;mshort.iihe.ac.be :  m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be &amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;mlong.iihe.ac.be : m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highbw&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid, subgroup of highmem, with WNs having 10Gb/s bandwidth access to storage.&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3512&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1984&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 448&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|25px|line=1|]] If you want to send more than 2500 jobs to the cluster, write all qsub commands in a text file, and use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; (more info [[bigSubmission|here]]).&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
=== Job Deletion ===&lt;br /&gt;
Use the following command:&lt;br /&gt;
 qdel &amp;lt;JOBID&amp;gt;&lt;br /&gt;
To delete all your jobs, be patient while using the following line:&lt;br /&gt;
 qdel $(qselect -u $USER -s EHQRTW)&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=CernLxplus&amp;diff=743</id>
		<title>CernLxplus</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=CernLxplus&amp;diff=743"/>
		<updated>2016-10-27T10:29:47Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* How to connect without password to lxplus */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
=== How to connect without password to lxplus ===&lt;br /&gt;
* The CERN wiki page is [http://linux.web.cern.ch/linux/docs/kerberos-access.shtml here]&lt;br /&gt;
* On &#039;&#039;&#039;ubuntu&#039;&#039;&#039;, you need to:&lt;br /&gt;
** install kerberos&lt;br /&gt;
 sudo apt-get install krb5-user&lt;br /&gt;
 with domain: CERN.CH&lt;br /&gt;
:* Copy to &#039;&#039;&#039;/etc/krb5.conf&#039;&#039;&#039; the config file taken from CERN [http://linux.web.cern.ch/linux/docs/krb5.conf here].&lt;br /&gt;
:* Do a &#039;&#039;&#039;kinit&#039;&#039;&#039; . It should ask you for you CERN password, specifying yourlogin@CERN.CH. If this is wrong, specify it yourself with &#039;&#039;&#039;kinit yourlogin@CERN.CH&#039;&#039;&#039;&lt;br /&gt;
::: This should give you a valid kerberos ticket lasting 25h, meaning you will be able to connect passwordlessly to lxplus as long as the kerberos ticket is valid.&lt;br /&gt;
::: To check if this worked, do a &#039;&#039;&#039;klist -f&#039;&#039;&#039;. Check the &#039;&#039;Flags&#039;&#039;, there should be an &#039;&#039;&#039;F&#039;&#039;&#039;, meaning you will have access to your /afs space.&lt;br /&gt;
:* Connect to lxplus using the following ssh options (replace &#039;&#039;MYLOGIN&#039;&#039; with the CERN login):&lt;br /&gt;
 ssh -2 -oGSSAPIAuthentication=yes -oGSSAPIDelegateCredentials=yes -oGSSAPITrustDNS=yes -l MYLOGIN lxplus.cern.ch&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt; &amp;lt;br&amp;gt;&lt;br /&gt;
* From the UIs, you should simply do:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
kinit&lt;br /&gt;
ssh -2 -oGSSAPIAuthentication=yes -oGSSAPIDelegateCredentials=yes -oGSSAPITrustDNS=yes -oStrictHostKeyChecking=no -l $USER $(nslookup lxplus.cern.ch|grep Address|sed -n 2,2p|awk -F&#039;:&#039; &#039;{print $2}&#039;)&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== How to copy data from disk (eg. beo5) to you space at lxplus? ===&lt;br /&gt;
Use secure copy: scp&lt;br /&gt;
 &amp;lt;pre&amp;gt; &lt;br /&gt;
scp  &#039;uname&#039;@lxpub3.iihe.ac.be:/beo5/somefile &#039;uname&#039;@lxplus.cern.ch:/afs/cern.ch/user/m/me &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== How to print from lxplus when you&#039;re in building 40 ===&lt;br /&gt;
Command to print &lt;br /&gt;
 &amp;lt;pre&amp;gt; &lt;br /&gt;
lpr -P 40-4B-COR .... &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
*[https://winservices.web.cern.ch/winservices/Services/Printers/PrinterList.aspx List] of available printers at CERN.&lt;br /&gt;
*[http://linux.web.cern.ch/linux/documentation/printing.shtml More] information.&lt;br /&gt;
*[http://cerncourier.com/cws/article/cnl/26307 Article] on printing&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=742</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=742"/>
		<updated>2016-10-26T14:08:46Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* How to Connect */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : mshort [ m0 , m1 , m2 , m3 ] , mlong [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@mshort.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;mshort.iihe.ac.be :  m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be &amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;mlong.iihe.ac.be : m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highbw&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid, subgroup of highmem, with WNs having 10Gb/s bandwidth access to storage.&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3512&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1984&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 448&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|25px|line=1|]] If you want to send more than 2500 jobs to the cluster, write all qsub commands in a text file, and use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; (more info [[bigSubmission|here]]).&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
=== Job Deletion ===&lt;br /&gt;
Use the following command:&lt;br /&gt;
 qdel &amp;lt;JOBID&amp;gt;&lt;br /&gt;
To delete all your jobs, be patient while using the following line:&lt;br /&gt;
 qstat localgrid@cream02 |grep $USER|awk &#039;{print &amp;quot;qdel &amp;quot;$1}&#039;|xargs -0 bash -c&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=741</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=741"/>
		<updated>2016-10-26T14:05:15Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : mshort [ m0 , m1 , m2 , m3 ] , mlong [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highbw&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid, subgroup of highmem, with WNs having 10Gb/s bandwidth access to storage.&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3512&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1984&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 448&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|25px|line=1|]] If you want to send more than 2500 jobs to the cluster, write all qsub commands in a text file, and use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; (more info [[bigSubmission|here]]).&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
=== Job Deletion ===&lt;br /&gt;
Use the following command:&lt;br /&gt;
 qdel &amp;lt;JOBID&amp;gt;&lt;br /&gt;
To delete all your jobs, be patient while using the following line:&lt;br /&gt;
 qstat localgrid@cream02 |grep $USER|awk &#039;{print &amp;quot;qdel &amp;quot;$1}&#039;|xargs -0 bash -c&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=729</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=729"/>
		<updated>2016-09-20T14:46:01Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Queues */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highbw&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid, subgroup of highmem, with WNs having 10Gb/s bandwidth access to storage.&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3512&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1984&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 448&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;3&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|25px|line=1|]] If you want to send more than 2500 jobs to the cluster, write all qsub commands in a text file, and use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; (more info [[bigSubmission|here]]).&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
=== Job Deletion ===&lt;br /&gt;
Use the following command:&lt;br /&gt;
 qdel &amp;lt;JOBID&amp;gt;&lt;br /&gt;
To delete all your jobs, be patient while using the following line:&lt;br /&gt;
 qstat localgrid@cream02 |grep $USER|awk &#039;{print &amp;quot;qdel &amp;quot;$1}&#039;|xargs -0 bash -c&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=725</id>
		<title>BigSubmission</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=725"/>
		<updated>2016-09-12T09:25:50Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Submitting more than 2500 jobs to the cluster */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Submitting more than 2500 jobs to the cluster ==&lt;br /&gt;
&lt;br /&gt;
Use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; on the UIs. Here&#039;s the help:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;gt; big-submission -h&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
  This script will execute qsub lines taken from a file, making sure there&#039;s&lt;br /&gt;
  at least 1500 jobs present every 900 seconds for the user on the cluster.&lt;br /&gt;
&lt;br /&gt;
USAGE:&lt;br /&gt;
  big-submission myqsubcommands.txt&lt;br /&gt;
&lt;br /&gt;
  with myqsubcommands.txt looking like:&lt;br /&gt;
qsub -q localgrid /path/to/myscript1.sh&lt;br /&gt;
qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
 # qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
&lt;br /&gt;
  big-submission -h : will print this help.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
DETAILS:&lt;br /&gt;
&lt;br /&gt;
  * The script works in the background, so you can deconnect from your session&lt;br /&gt;
    while it continues sending jobs.&lt;br /&gt;
  * Commented lines starting with &#039;#&#039; as well as empty lines will be omitted.&lt;br /&gt;
  * Each time a job is sent, it is deleted from myqsubcommands.txt,&lt;br /&gt;
    ie the file always contains jobs not sent. This way, if you kill&lt;br /&gt;
    big-submission, you know what&#039;s left to send.&lt;br /&gt;
  * The lines that failed will be tried once more at the end.&lt;br /&gt;
  * The script is meant to run once per user. If you want to add&lt;br /&gt;
    some commands to send, [stop big-submission], add them to &lt;br /&gt;
    myqsubcommands.txt, [restart big-submission].&lt;br /&gt;
    Editing the command list live is possible although not recommended,&lt;br /&gt;
    and should be done fast, with for instance `cat file2 &amp;gt;&amp;gt; myqsubcommands.txt`&lt;br /&gt;
  * To read the logs, one suggests using `less -r logfile`, this way colors will be&lt;br /&gt;
    printed nicely.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=724</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=724"/>
		<updated>2016-09-08T15:25:56Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Job Statistics */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|25px|line=1|]] If you want to send more than 2500 jobs to the cluster, write all qsub commands in a text file, and use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; (more info [[bigSubmission|here]]).&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
=== Job Deletion ===&lt;br /&gt;
Use the following command:&lt;br /&gt;
 qdel &amp;lt;JOBID&amp;gt;&lt;br /&gt;
To delete all your jobs, be patient while using the following line:&lt;br /&gt;
 qstat localgrid@cream02 |grep $USER|awk &#039;{print &amp;quot;qdel &amp;quot;$1}&#039;|xargs -0 bash -c&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=723</id>
		<title>BigSubmission</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=723"/>
		<updated>2016-09-08T08:20:35Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Submitting more than 2500 jobs to the cluster */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Submitting more than 2500 jobs to the cluster ==&lt;br /&gt;
&lt;br /&gt;
Use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; on the UIs. Here&#039;s the help:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;gt; big-submission -h&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
  This script will execute qsub lines taken from a file, making sure there&#039;s&lt;br /&gt;
  at least 1500 jobs present every 900 seconds for the user on the cluster.&lt;br /&gt;
&lt;br /&gt;
USAGE:&lt;br /&gt;
  big-submission myqsubcommands.txt&lt;br /&gt;
&lt;br /&gt;
  with myqsubcommands.txt looking like:&lt;br /&gt;
qsub -q localgrid /path/to/myscript1.sh&lt;br /&gt;
qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
 # qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
&lt;br /&gt;
  big-submission -h : will print this help.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
DETAILS:&lt;br /&gt;
&lt;br /&gt;
  * Commented lines starting with &#039;#&#039; as well as empty lines will be omitted.&lt;br /&gt;
  * Each time a job is sent, it is deleted from myqsubcommands.txt,&lt;br /&gt;
    ie the file always contains jobs not sent. This way, if you kill&lt;br /&gt;
    big-submission, you know what&#039;s left to send.&lt;br /&gt;
  * The lines that failed will be tried once more at the end.&lt;br /&gt;
  * The script is meant to run once per user. If you want to add&lt;br /&gt;
    some commands to send, [stop big-submission], add them to &lt;br /&gt;
    myqsubcommands.txt, [restart big-submission].&lt;br /&gt;
    Editing the command list live is possible although not recommended,&lt;br /&gt;
    and should be done fast, with for instance `cat file2 &amp;gt;&amp;gt; myqsubcommands.txt`&lt;br /&gt;
  * To read the logs, I suggest using `less -r logfile`, this way colors will be&lt;br /&gt;
    printed nicely.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=716</id>
		<title>BigSubmission</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=716"/>
		<updated>2016-08-25T16:02:20Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Submitting more than 2500 jobs to the cluster */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Submitting more than 2500 jobs to the cluster ==&lt;br /&gt;
&lt;br /&gt;
Use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; on the UIs. Here&#039;s the help:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;gt; big-submission -h&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
  This script will execute qsub lines taken from a file, making sure there&#039;s&lt;br /&gt;
  at least 1500 jobs present every 900 seconds for the user on the cluster.&lt;br /&gt;
&lt;br /&gt;
USAGE:&lt;br /&gt;
  big-submission myqsubcommands.txt&lt;br /&gt;
&lt;br /&gt;
  with myqsubcommands.txt looking like:&lt;br /&gt;
qsub -q localgrid /path/to/myscript1.sh&lt;br /&gt;
qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
# qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
&lt;br /&gt;
  big-submission -h : will print this help.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
DETAILS:&lt;br /&gt;
&lt;br /&gt;
  * Commented lines starting with &#039;#&#039; will be omitted.&lt;br /&gt;
  * Each time a job is sent, it is deleted from myqsubcommands.txt,&lt;br /&gt;
    ie the file always contains jobs not sent. This way, if you kill&lt;br /&gt;
    big-submission, you know what&#039;s left to send.&lt;br /&gt;
  * The script is meant to run once per user. If you want to add&lt;br /&gt;
    some commands to send, [stop big-submission], add them to &lt;br /&gt;
    myqsubcommands.txt, [restart big-submission].&lt;br /&gt;
    Editing the command list live is possible although not recommended,&lt;br /&gt;
    and should be done fast, with for instance `cat file2 &amp;gt;&amp;gt; myqsubcommands.txt`&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Main_Page&amp;diff=715</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Main_Page&amp;diff=715"/>
		<updated>2016-08-11T13:08:45Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Using the Tier2 computing resources */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Welcome to the CMS Belgian T2 Wiki ==&lt;br /&gt;
&amp;lt;center&amp;gt;&amp;lt;span style=&amp;quot;font-size: 300%;&amp;quot;&amp;gt; [[first_access_to_t2b|=&amp;gt; FIRST ACCESS TO T2B &amp;lt;=]] &amp;lt;/span&amp;gt;&amp;lt;/center&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== General information for users ===&lt;br /&gt;
*[[Faq_t2b | FAQ]]&lt;br /&gt;
*[[ Cluster_Presentation | Cluster Presentation ]]&lt;br /&gt;
==== Information for new users ====&lt;br /&gt;
*[[T2bSupport| T2B Support guidelines]]&lt;br /&gt;
*[[First_access_to_t2b|Getting access to T2B]]&lt;br /&gt;
*[[Getting_a_certificate_for_the_T2|Certificates and VOs]]&lt;br /&gt;
&lt;br /&gt;
==== Using the Tier2 computing resources ====&lt;br /&gt;
*[[policies| Policies concerning the usage of computing resources.]]&lt;br /&gt;
*[[CurrentStatus| Current status of the Tier2]]&lt;br /&gt;
*[[Getting_started_with_the_CMSSW_software| Getting started with the CMSSW software]]&lt;br /&gt;
*[[Using_Git| Using Git]]&lt;br /&gt;
*[[Getting_started_with_the_MadGraph_software| Getting started with the MadGraph software]]&lt;br /&gt;
*Submitting jobs with CRAB&lt;br /&gt;
**[[gridSubmission_withCrab| To the worldwide grid]]&lt;br /&gt;
*Submitting jobs without CRAB&lt;br /&gt;
**[[localSubmission| To the local resources]]&lt;br /&gt;
::* [[bigSubmission|Sending unlimited number of Jobs]]&lt;br /&gt;
:*[[gridSubmission| To the worldwide grid]]&lt;br /&gt;
*[[GridStorageAccess| How to handle data on Grid storage]]&lt;br /&gt;
*[[FAQ_CMSSW_on_the_Grid| FAQ CMSSW on the Grid on proxy and more!]]&lt;br /&gt;
*[[OtherSoftware| Other software available at the T2]]&lt;br /&gt;
&lt;br /&gt;
==== Back-up procedures ====&lt;br /&gt;
*[[BackupDocsLinuxLaptop| Procedure to automate backups of personal documents (Linux laptops only)]]&lt;br /&gt;
*[[Backup| Backups of /user , /group , /data , /ice3]]&lt;br /&gt;
*[[AccidentalDeleteOfFiles| What to do if I have accidentally deleted some files on my personal computer ?]]&lt;br /&gt;
&lt;br /&gt;
==== Other topics ====&lt;br /&gt;
*[[User_Meetings]] list of user meetings with added transparencies&lt;br /&gt;
*[[Basic_computing_skills| Basic computing skills]]&lt;br /&gt;
*[[CernLxplus| Useful info on use of lxplus.cern.ch]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Dedicated twiki pages maintained by several research groups ===&lt;br /&gt;
*[[TtBar_Analysis_Framework| TtBar Analysis Framework (old)]]&lt;br /&gt;
*[[TopQuarkGroup| Top Quark Group wiki]]&lt;br /&gt;
*[[HEEP_Analysis_Framework| HEEP Analysis Framework]]&lt;br /&gt;
*[[V0_Analysis_wiki| V0 Analysis wiki]]&lt;br /&gt;
*[[Info_exchange| Higgs analysis]]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Service work for CMS ===&lt;br /&gt;
*[[DDT]]&lt;br /&gt;
*[[CMSSWDeployment]]&lt;br /&gt;
*[[ProdAgentAllUsers| Prodagent for users]]&lt;br /&gt;
*[[TestStoreTemp| Writing tests in /store/temp/user on T2 SE&#039;s]]&lt;br /&gt;
*[[CrabServerInstall| CRAB Server installation]]&lt;br /&gt;
*[[CrabValidation| Basic validation of CRAB releases]]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Obsolete twiki pages ===&lt;br /&gt;
*[[DIY-UI]]&lt;br /&gt;
*[[CrabIIHETransitionSLC4ToSLC5| Using CRAB at IIHE during the transition from SLC4 to SLC5]]&lt;br /&gt;
*[[DataSamplesRequests2007| Samples Requests for 2007]]&lt;br /&gt;
*[[DataSamplesRequests2008| Samples Requests for 2008]]&lt;br /&gt;
*[[CrabNewIIHE| Crab at IIHE]]&lt;br /&gt;
*[[OldMainPage| The old main page is kept here for reference]]&lt;br /&gt;
&lt;br /&gt;
== Admin section ==&lt;br /&gt;
*[[AdminPage| Pages for administrators]]&lt;br /&gt;
&lt;br /&gt;
== Getting started with MediaWiki ==&lt;br /&gt;
*[//meta.wikimedia.org/wiki/Help:Contents User&#039;s Guide]&lt;br /&gt;
*[//www.mediawiki.org/wiki/Manual:Configuration_settings Configuration settings list]&lt;br /&gt;
*[//www.mediawiki.org/wiki/Manual:FAQ MediaWiki FAQ]&lt;br /&gt;
*[https://lists.wikimedia.org/mailman/listinfo/mediawiki-announce MediaWiki release mailing list]&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=714</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=714"/>
		<updated>2016-08-11T12:33:56Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Job submission */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|25px|line=1|]] If you want to send more than 2500 jobs to the cluster, write all qsub commands in a text file, and use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; (more info [[bigSubmission|here]]).&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=713</id>
		<title>BigSubmission</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=BigSubmission&amp;diff=713"/>
		<updated>2016-08-11T12:29:00Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: Created page with &amp;quot;== Submitting more than 2500 jobs to the cluster ==  Use the script &amp;#039;&amp;#039;&amp;#039;big-submission&amp;#039;&amp;#039;&amp;#039; on the UIs. Here&amp;#039;s the help: &amp;lt;pre&amp;gt; &amp;gt; big-submission -h     This script will execute qs...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Submitting more than 2500 jobs to the cluster ==&lt;br /&gt;
&lt;br /&gt;
Use the script &#039;&#039;&#039;big-submission&#039;&#039;&#039; on the UIs. Here&#039;s the help:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;gt; big-submission -h&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
  This script will execute qsub lines taken from a file, making sure there&#039;s&lt;br /&gt;
  at least 1500 jobs present every 900 seconds for the user on the cluster.&lt;br /&gt;
&lt;br /&gt;
Usage:&lt;br /&gt;
  big-submission myqsubcommands.txt&lt;br /&gt;
&lt;br /&gt;
  with myqsubcommands.txt looking like:&lt;br /&gt;
qsub -q localgrid /path/to/myscript1.sh&lt;br /&gt;
qsub -q localgrid /path/to/myscript2.sh&lt;br /&gt;
&lt;br /&gt;
  big-submission -h : will print this help.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=712</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=712"/>
		<updated>2016-08-11T12:24:50Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Queues */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;  (see [[bigSubmission|here]] if you want to send more) &amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Backup&amp;diff=707</id>
		<title>Backup</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Backup&amp;diff=707"/>
		<updated>2016-07-15T14:16:15Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Backups ==&lt;br /&gt;
&lt;br /&gt;
All backups can be found in the &#039;&#039;&#039;/backup&#039;&#039;&#039; directory on the UIs and WNs.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== /user &amp;amp; /group ===&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done every 24 hours at midnight, we keep the last seven of those (so a complete week), eg: &lt;br /&gt;
:: &amp;lt;pre&amp;gt;auto:hourly-storage-2016-07-09-00:00&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* We also have one snapshot done every 7 days at 3am, and keep the last 4 of them (so a month), eg:&lt;br /&gt;
:: &amp;lt;pre&amp;gt;auto:daily-storage-2016-06-29-03:00&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=706</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=706"/>
		<updated>2016-07-15T13:13:51Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Backup */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
There are several areas that we regularly back up: &#039;&#039;&#039;/user&#039;&#039;&#039; , &#039;&#039;&#039;/group&#039;&#039;&#039; , &#039;&#039;&#039;/data&#039;&#039;&#039; , &#039;&#039;&#039;/ice3&#039;&#039;&#039;.&amp;lt;br&amp;gt;&lt;br /&gt;
You can find more information on the backup frequency and how to access them [[Backup|here]].&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Main_Page&amp;diff=705</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Main_Page&amp;diff=705"/>
		<updated>2016-07-15T13:08:48Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Back-up procedures */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Welcome to the CMS Belgian T2 Wiki ==&lt;br /&gt;
&amp;lt;center&amp;gt;&amp;lt;span style=&amp;quot;font-size: 300%;&amp;quot;&amp;gt; [[first_access_to_t2b|=&amp;gt; FIRST ACCESS TO T2B &amp;lt;=]] &amp;lt;/span&amp;gt;&amp;lt;/center&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== General information for users ===&lt;br /&gt;
*[[Faq_t2b | FAQ]]&lt;br /&gt;
*[[ Cluster_Presentation | Cluster Presentation ]]&lt;br /&gt;
==== Information for new users ====&lt;br /&gt;
*[[T2bSupport| T2B Support guidelines]]&lt;br /&gt;
*[[First_access_to_t2b|Getting access to T2B]]&lt;br /&gt;
*[[Getting_a_certificate_for_the_T2|Certificates and VOs]]&lt;br /&gt;
&lt;br /&gt;
==== Using the Tier2 computing resources ====&lt;br /&gt;
*[[policies| Policies concerning the usage of computing resources.]]&lt;br /&gt;
*[[CurrentStatus| Current status of the Tier2]]&lt;br /&gt;
*[[Getting_started_with_the_CMSSW_software| Getting started with the CMSSW software]]&lt;br /&gt;
*[[Using_Git| Using Git]]&lt;br /&gt;
*[[Getting_started_with_the_MadGraph_software| Getting started with the MadGraph software]]&lt;br /&gt;
*Submitting jobs with CRAB&lt;br /&gt;
**[[gridSubmission_withCrab| To the worldwide grid]]&lt;br /&gt;
*Submitting jobs without CRAB&lt;br /&gt;
**[[localSubmission| To the local resources]]&lt;br /&gt;
**[[gridSubmission| To the worldwide grid]]&lt;br /&gt;
*[[GridStorageAccess| How to handle data on Grid storage]]&lt;br /&gt;
*[[FAQ_CMSSW_on_the_Grid| FAQ CMSSW on the Grid on proxy and more!]]&lt;br /&gt;
*[[OtherSoftware| Other software available at the T2]]&lt;br /&gt;
&lt;br /&gt;
==== Back-up procedures ====&lt;br /&gt;
*[[BackupDocsLinuxLaptop| Procedure to automate backups of personal documents (Linux laptops only)]]&lt;br /&gt;
*[[Backup| Backups of /user , /group , /data , /ice3]]&lt;br /&gt;
*[[AccidentalDeleteOfFiles| What to do if I have accidentally deleted some files on my personal computer ?]]&lt;br /&gt;
&lt;br /&gt;
==== Other topics ====&lt;br /&gt;
*[[User_Meetings]] list of user meetings with added transparencies&lt;br /&gt;
*[[Basic_computing_skills| Basic computing skills]]&lt;br /&gt;
*[[CernLxplus| Useful info on use of lxplus.cern.ch]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Dedicated twiki pages maintained by several research groups ===&lt;br /&gt;
*[[TtBar_Analysis_Framework| TtBar Analysis Framework (old)]]&lt;br /&gt;
*[[TopQuarkGroup| Top Quark Group wiki]]&lt;br /&gt;
*[[HEEP_Analysis_Framework| HEEP Analysis Framework]]&lt;br /&gt;
*[[V0_Analysis_wiki| V0 Analysis wiki]]&lt;br /&gt;
*[[Info_exchange| Higgs analysis]]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Service work for CMS ===&lt;br /&gt;
*[[DDT]]&lt;br /&gt;
*[[CMSSWDeployment]]&lt;br /&gt;
*[[ProdAgentAllUsers| Prodagent for users]]&lt;br /&gt;
*[[TestStoreTemp| Writing tests in /store/temp/user on T2 SE&#039;s]]&lt;br /&gt;
*[[CrabServerInstall| CRAB Server installation]]&lt;br /&gt;
*[[CrabValidation| Basic validation of CRAB releases]]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Obsolete twiki pages ===&lt;br /&gt;
*[[DIY-UI]]&lt;br /&gt;
*[[CrabIIHETransitionSLC4ToSLC5| Using CRAB at IIHE during the transition from SLC4 to SLC5]]&lt;br /&gt;
*[[DataSamplesRequests2007| Samples Requests for 2007]]&lt;br /&gt;
*[[DataSamplesRequests2008| Samples Requests for 2008]]&lt;br /&gt;
*[[CrabNewIIHE| Crab at IIHE]]&lt;br /&gt;
*[[OldMainPage| The old main page is kept here for reference]]&lt;br /&gt;
&lt;br /&gt;
== Admin section ==&lt;br /&gt;
*[[AdminPage| Pages for administrators]]&lt;br /&gt;
&lt;br /&gt;
== Getting started with MediaWiki ==&lt;br /&gt;
*[//meta.wikimedia.org/wiki/Help:Contents User&#039;s Guide]&lt;br /&gt;
*[//www.mediawiki.org/wiki/Manual:Configuration_settings Configuration settings list]&lt;br /&gt;
*[//www.mediawiki.org/wiki/Manual:FAQ MediaWiki FAQ]&lt;br /&gt;
*[https://lists.wikimedia.org/mailman/listinfo/mediawiki-announce MediaWiki release mailing list]&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=GridStorageAccess&amp;diff=703</id>
		<title>GridStorageAccess</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=GridStorageAccess&amp;diff=703"/>
		<updated>2016-07-13T15:23:56Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* GFAL */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
This page describes how to handle data stored on grid storage.&lt;br /&gt;
&lt;br /&gt;
== Before starting ==&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|40x30px|line=1|]] Before being able to run these commands you first need to make a valid proxy [[File:Exclamation-mark.jpg|40x30px|line=1|]] &amp;lt;br&amp;gt;&lt;br /&gt;
::: &amp;lt;pre&amp;gt;voms-proxy-init --voms cms&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
== GFAL ==&lt;br /&gt;
GFAL is a wrapper around the latest grid commands. Learning to use it means that whatever middleware requires to be used in the future, you don&#039;t need to learn new commands (like srm, lcg, etc)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
=== gfal-commands ===&lt;br /&gt;
If you want more information on the options that can be used, please use &#039;&#039;&#039;man gfal-command&#039;&#039;&#039; !&lt;br /&gt;
&lt;br /&gt;
Here are all the commands that can be used:&lt;br /&gt;
*&#039;&#039;gfal-ls&#039;&#039;: get information on a file&lt;br /&gt;
*&#039;&#039;gfal-mkdir&#039;&#039;: remove a directory&lt;br /&gt;
*&#039;&#039;gfal-rm&#039;&#039;: removes a file. To remove an entire directory, use -r&lt;br /&gt;
*&#039;&#039;gfal-copy&#039;&#039;: copy files.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
=== Usage ===&lt;br /&gt;
There are 2 types of file url:&lt;br /&gt;
* &#039;&#039;&#039;Distant files&#039;&#039;&#039;: their url is of the type srm://&amp;lt;name_of_server&amp;gt;:&amp;lt;port&amp;gt;/some/path, eg for IIHE:&lt;br /&gt;
 srm://maite.iihe.ac.be:8443/pnfs/iihe/&lt;br /&gt;
* &#039;&#039;&#039;Local files&#039;&#039;&#039;: their url is of the type file://path_of_the_file, eg for IIHE:&lt;br /&gt;
 file:///user/$USER/MyFile.root&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|40x30px|line=1|]] Be careful, the number of &#039;&#039;&#039;/&#039;&#039;&#039; is very -very- important [[File:Exclamation-mark.jpg|40x30px|line=1|]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*To get a list of all distant urls for all the Storage Elements, one can do:&lt;br /&gt;
::&amp;lt;pre&amp;gt; lcg-infosites --vo cms se &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Examples ===&lt;br /&gt;
*To list the contents of a directory &#039;&#039;/pnfs/iihe/cms&#039;&#039; :&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-ls srm://maite.iihe.ac.be:8443/pnfs/iihe/cms &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To create a directory:&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-mkdir srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/NewDir &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
*copy file from local disk to remote server &lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-copy file:///user/$USER/MyFile.root srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/ &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To copy a file from remote server to our Storage Element:&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-copy srm://srm-eoscms.cern.ch:8443/srm/v2/server?SFN=/eos/cms/store/group/comm_trigger/L1TrackTrigger/BE5D_620_SLHC6/singleMu/NoPU/reDIGI_SLHC6-TrackTrigger_muon_pgun-0499.root srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/odevroed/eosTransfer.root &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To delete a file on remote server&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-rm srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/MyFile.root &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To remove a directory and its entire content on remote server ?!? not working for now ?):   &lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-rm -r srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/NewDir &amp;lt;/pre&amp;gt;   &lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=== Bulk file transfers ===&lt;br /&gt;
&lt;br /&gt;
There is an elegant way to run gfal-copy through several files. This is done using the &#039;&#039;&#039;--from-file&#039;&#039;&#039; option.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Syntax:&lt;br /&gt;
&amp;lt;pre&amp;gt; gfal-copy -f --from-file files.txt file://$PWD &amp;lt;/pre&amp;gt;&lt;br /&gt;
where files.txt is a file where every line is a source in srm url syntax.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Make some tests with one line in datafile and make sure the srm url is OK for both source and destination before running over several files.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Copy directories from and to pnfs within the IIHE ===&lt;br /&gt;
A script to copy full directories to and from pnfs exists on the slc6 UI&#039;s:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
copyDirectoryPnfs.py&lt;br /&gt;
Move all files in a directory to or from pnfs&lt;br /&gt;
This script assumes that you copy within the IIHE&lt;br /&gt;
The script does not do recursive copying&lt;br /&gt;
Make sure you have a valid proxy, made with voms-proxy-init --voms cms:/cms/becms&lt;br /&gt;
&lt;br /&gt;
Mandatory options:&lt;br /&gt;
--in=                 : directory to copy from&lt;br /&gt;
--out=                : directory to copy to&lt;br /&gt;
Both directories need to be complete (i.e. including the /pnfs or /user part&lt;br /&gt;
&lt;br /&gt;
example:&lt;br /&gt;
copyDirectoryPnfs.py --out=/user/odevroed/newfile --in=/pnfs/iihe/cms/store/user/odevroed/newdir&lt;br /&gt;
&lt;br /&gt;
Optional:&lt;br /&gt;
-h, --help             : print this help message&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== dCache ==&lt;br /&gt;
Direct dcache access to files is only possible if the software supports it. &lt;br /&gt;
PNFS (the directory structure seen under /pnfs/) is NOT a real filesystem: it is an &#039;&#039;&#039;immutable&#039;&#039;&#039; filesystem, and is mounted read-only.&lt;br /&gt;
&lt;br /&gt;
*Therefore, here is a list of the commands that work:&lt;br /&gt;
 &#039;&#039;&#039;ls&#039;&#039;&#039;&lt;br /&gt;
* And replacement commands:&lt;br /&gt;
** &#039;&#039;&#039;dccp&#039;&#039;&#039;:  to copy files from /pnfs to local disk. Example:&lt;br /&gt;
 dccp dcap://maite.iihe.ac.be/pnfs/iihe&lt;br /&gt;
&lt;br /&gt;
* To open files  using root, use eg&lt;br /&gt;
&amp;lt;pre&amp;gt; root dcap://maite.iihe.ac.be/pnfs/iihe/some/file.root &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When reading out the rootfiles is rather slow or it doesn&#039;t work at all and nothing is wrong with the root file (e.g. in an interactive analysis on beo or msa) you can increase your dCache readahead buffer. Don&#039;t make the buffer larger than 50MB! To enlarge the buffer set this in you environment for csh:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
setenv DCACHE_RAHEAD 1&lt;br /&gt;
setenv DCACHE_RA_BUFFER 50000000&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
For bash:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
export DCACHE_RAHEAD=true&lt;br /&gt;
export DCACHE_RA_BUFFER=50000000&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
See the dChache [http://www.dcache.org/ fanpage] for further reading.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=GridStorageAccess&amp;diff=702</id>
		<title>GridStorageAccess</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=GridStorageAccess&amp;diff=702"/>
		<updated>2016-07-13T15:22:10Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
This page describes how to handle data stored on grid storage.&lt;br /&gt;
&lt;br /&gt;
== Before starting ==&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|40x30px|line=1|]] Before being able to run these commands you first need to make a valid proxy [[File:Exclamation-mark.jpg|40x30px|line=1|]] &amp;lt;br&amp;gt;&lt;br /&gt;
::: &amp;lt;pre&amp;gt;voms-proxy-init --voms cms&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== GFAL ==&lt;br /&gt;
GFAL is a wrapper around the latest grid commands. Learning to use it means that whatever middleware requires to be used in the future, you don&#039;t need to learn new commands (like srm, lcg, etc)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
=== gfal-commands ===&lt;br /&gt;
If you want more information on the options that can be used, please use &#039;&#039;&#039;man gfal-command&#039;&#039;&#039; !&lt;br /&gt;
&lt;br /&gt;
Here are all the commands that can be used:&lt;br /&gt;
*&#039;&#039;gfal-ls&#039;&#039;: get information on a file&lt;br /&gt;
*&#039;&#039;gfal-mkdir&#039;&#039;: remove a directory&lt;br /&gt;
*&#039;&#039;gfal-rm&#039;&#039;: removes a file. To remove an entire directory, use -r&lt;br /&gt;
*&#039;&#039;gfal-copy&#039;&#039;: copy files.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
=== Usage ===&lt;br /&gt;
There are 2 types of file url:&lt;br /&gt;
* &#039;&#039;&#039;Distant files&#039;&#039;&#039;: their url is of the type srm://&amp;lt;name_of_server&amp;gt;:&amp;lt;port&amp;gt;/some/path, eg for IIHE:&lt;br /&gt;
 srm://maite.iihe.ac.be:8443/pnfs/iihe/&lt;br /&gt;
* &#039;&#039;&#039;Local files&#039;&#039;&#039;: their url is of the type file://path_of_the_file, eg for IIHE:&lt;br /&gt;
 file:///user/$USER/MyFile.root&lt;br /&gt;
&lt;br /&gt;
[[File:Exclamation-mark.jpg|left|40x30px|line=1|]] Be careful, the number of &#039;&#039;&#039;/&#039;&#039;&#039; is very -very- important [[File:Exclamation-mark.jpg|40x30px|line=1|]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*To get a list of all distant urls for all the Storage Elements, one can do:&lt;br /&gt;
::&amp;lt;pre&amp;gt; lcg-infosites --vo cms se &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Examples ===&lt;br /&gt;
*To list the contents of a directory &#039;&#039;/pnfs/iihe/cms&#039;&#039; :&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-ls srm://maite.iihe.ac.be:8443/pnfs/iihe/cms &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To create a directory:&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-mkdir srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/NewDir &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
*copy file from local disk to remote server &lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-copy file:///user/$USER/MyFile.root srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/ &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To copy a file from remote server to our Storage Element:&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-copy srm://srm-eoscms.cern.ch:8443/srm/v2/server?SFN=/eos/cms/store/group/comm_trigger/L1TrackTrigger/BE5D_620_SLHC6/singleMu/NoPU/reDIGI_SLHC6-TrackTrigger_muon_pgun-0499.root srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/odevroed/eosTransfer.root &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To delete a file on remote server&lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-rm srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/MyFile.root &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* To remove a directory and its entire content on remote server ?!? not working for now ?):   &lt;br /&gt;
::&amp;lt;pre&amp;gt; gfal-rm -r srm://maite.iihe.ac.be:8443/pnfs/iihe/cms/store/user/$USER/NewDir &amp;lt;/pre&amp;gt;   &lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
=== Bulk file transfers ===&lt;br /&gt;
&lt;br /&gt;
There is an elegant way to run gfal-copy through several files. This is done using the &#039;&#039;&#039;--from-file&#039;&#039;&#039; option.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Syntax:&lt;br /&gt;
&amp;lt;pre&amp;gt; gfal-copy -f --from-file files.txt file://$PWD &amp;lt;/pre&amp;gt;&lt;br /&gt;
where files.txt is a file where every line is a source in srm url syntax.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Make some tests with one line in datafile and make sure the srm url is OK for both source and destination before running over several files.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Copy directories from and to pnfs within the IIHE ===&lt;br /&gt;
A script to copy full directories to and from pnfs exists on the slc6 UI&#039;s:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
copyDirectoryPnfs.py&lt;br /&gt;
Move all files in a directory to or from pnfs&lt;br /&gt;
This script assumes that you copy within the IIHE&lt;br /&gt;
The script does not do recursive copying&lt;br /&gt;
Make sure you have a valid proxy, made with voms-proxy-init --voms cms:/cms/becms&lt;br /&gt;
&lt;br /&gt;
Mandatory options:&lt;br /&gt;
--in=                 : directory to copy from&lt;br /&gt;
--out=                : directory to copy to&lt;br /&gt;
Both directories need to be complete (i.e. including the /pnfs or /user part&lt;br /&gt;
&lt;br /&gt;
example:&lt;br /&gt;
copyDirectoryPnfs.py --out=/user/odevroed/newfile --in=/pnfs/iihe/cms/store/user/odevroed/newdir&lt;br /&gt;
&lt;br /&gt;
Optional:&lt;br /&gt;
-h, --help             : print this help message&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== dCache ==&lt;br /&gt;
Direct dcache access to files is only possible if the software supports it. &lt;br /&gt;
PNFS (the directory structure seen under /pnfs/) is NOT a real filesystem: it is an &#039;&#039;&#039;immutable&#039;&#039;&#039; filesystem, and is mounted read-only.&lt;br /&gt;
&lt;br /&gt;
*Therefore, here is a list of the commands that work:&lt;br /&gt;
 &#039;&#039;&#039;ls&#039;&#039;&#039;&lt;br /&gt;
* And replacement commands:&lt;br /&gt;
** &#039;&#039;&#039;dccp&#039;&#039;&#039;:  to copy files from /pnfs to local disk. Example:&lt;br /&gt;
 dccp dcap://maite.iihe.ac.be/pnfs/iihe&lt;br /&gt;
&lt;br /&gt;
* To open files  using root, use eg&lt;br /&gt;
&amp;lt;pre&amp;gt; root dcap://maite.iihe.ac.be/pnfs/iihe/some/file.root &amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When reading out the rootfiles is rather slow or it doesn&#039;t work at all and nothing is wrong with the root file (e.g. in an interactive analysis on beo or msa) you can increase your dCache readahead buffer. Don&#039;t make the buffer larger than 50MB! To enlarge the buffer set this in you environment for csh:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
setenv DCACHE_RAHEAD 1&lt;br /&gt;
setenv DCACHE_RA_BUFFER 50000000&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
For bash:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
export DCACHE_RAHEAD=true&lt;br /&gt;
export DCACHE_RA_BUFFER=50000000&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
See the dChache [http://www.dcache.org/ fanpage] for further reading.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=File:Exclamation-mark.jpg&amp;diff=701</id>
		<title>File:Exclamation-mark.jpg</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=File:Exclamation-mark.jpg&amp;diff=701"/>
		<updated>2016-07-13T13:14:00Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=698</id>
		<title>Getting a certificate for the T2</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=698"/>
		<updated>2016-06-22T15:53:07Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* First time certificate */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;If you need grid access on the T2, please follow all the steps below: &amp;lt;br&amp;gt; &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== First time certificate ===&lt;br /&gt;
# [[Obtaining_a_certificate | Get a certificate (new)]]. &#039;&#039;If you already have a CERN grid certificate, you can temporarily use it. But for accounting reasons, we need you to get a Belgian certificate &#039;&#039;&lt;br /&gt;
# Extract your certificate from your browser: &#039;&#039;&#039;[[Chrome]]&#039;&#039;&#039; | &#039;&#039;&#039;[[Firefox]]&#039;&#039;&#039;&lt;br /&gt;
# [[certificate_to_UI | Put your certificate on the UIs]]&lt;br /&gt;
# Register to the VO&lt;br /&gt;
#:* CMS&lt;br /&gt;
#:*# [[Register_to_the_CMS_VO|Register to the CMS VO]]&lt;br /&gt;
#:*# [[SiteDB | Check if your certificate is ok on SiteDB]]. Note the DN.&lt;br /&gt;
#:*# [[CERN_certificate_management | Check that your certificate is the only one registered on the CERN website.]]&lt;br /&gt;
#:* IceCube&lt;br /&gt;
#:** [[Register_to_the_IceCube_VO|Register to the IceCube VO]]&lt;br /&gt;
# Send a mail to the T2B support (grid_adminATlistserv.vub.ac.be) with your DN in order to have write access on the T2.&lt;br /&gt;
# [[Check_Certificate_UIs | Check if everything works fine on the mX machines]]&lt;br /&gt;
&lt;br /&gt;
=== Certificate renewal ===&lt;br /&gt;
As we switched the company who provides us with certificates, you need to follow the &#039;First time certificate&#039; to have a new one, this until December 2016 at least.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=695</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=695"/>
		<updated>2016-06-10T08:48:42Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Queues */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |30 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Monitoring&amp;diff=694</id>
		<title>Monitoring</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Monitoring&amp;diff=694"/>
		<updated>2016-05-20T18:11:53Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|  &#039;&#039;&#039;Nagios old style&#039;&#039;&#039;&lt;br /&gt;
|  &#039;&#039;&#039;Nagios Check_mk style&#039;&#039;&#039;&lt;br /&gt;
|  &#039;&#039;&#039;Nagios History&#039;&#039;&#039;&lt;br /&gt;
|  &#039;&#039;&#039;JSON&#039;&#039;&#039;  &lt;br /&gt;
|  &#039;&#039;&#039;Python&#039;&#039;&#039; &lt;br /&gt;
|-&lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/status.cgi?host=cream02.iihe.ac.be][cream02]]&lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=cream02.iihe.ac.be][cream02]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/notifications.cgi?host=cream02.iihe.ac.be][cream02]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=cream02.iihe.ac.be&amp;amp;output_format=json][cream02]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=cream02.iihe.ac.be&amp;amp;output_format=python][cream02]]   &lt;br /&gt;
|-&lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/status.cgi?host=maite.iihe.ac.be][maite]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=maite.iihe.ac.be][maite]] &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/notifications.cgi?host=maite.iihe.ac.be][maite]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=maite.iihe.ac.be&amp;amp;output_format=json][maite]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=maite.iihe.ac.be&amp;amp;output_format=python][maite]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
Links:&lt;br /&gt;
&lt;br /&gt;
*Pilot Monitoring (CMS):&lt;br /&gt;
** &#039;&#039;&#039;Pilot factory&#039;&#039;&#039; http://submit-3.t2.ucsd.edu/CSstoragePath/aperez/T2s/multicore_occupancy_t2s_168h.html&lt;br /&gt;
** &#039;&#039;&#039;Current Pilot info&#039;&#039;&#039; http://submit-3.t2.ucsd.edu/CSstoragePath/aperez/HTML/site_status/status_mcore_T2_BE_IIHE.html&lt;br /&gt;
** &#039;&#039;&#039;Factory+Pressure summary&#039;&#039;&#039; http://cms-gwmsmon.cern.ch/totalview/T2_BE_IIHE&lt;br /&gt;
** &#039;&#039;&#039;Schedds + Pressure + Exclusive Pressure&#039;&#039;&#039; http://submit-3.t2.ucsd.edu/CSstoragePath/Monitor/latest-new.txt&lt;br /&gt;
&lt;br /&gt;
*VUB-ULB GRID:&lt;br /&gt;
**Ganglia http://mon.iihe.ac.be/ganglia&lt;br /&gt;
&lt;br /&gt;
*BEgrid:&lt;br /&gt;
**Ganglia https://ganglia.begrid.be/ganglia/&lt;br /&gt;
**GridIce https://gridice.begrid.be/gridice&lt;br /&gt;
&lt;br /&gt;
*LCG&lt;br /&gt;
**GocDB/Gstat http://goc.grid.sinica.edu.tw/gstat/&lt;br /&gt;
**GridIce http://gridice2.cnaf.infn.it:50080/gridice/site/site.php&lt;br /&gt;
**Admin SFTs https://monitoring.egee.man.poznan.pl/admin2&lt;br /&gt;
**GGUS Ticket Service https://gus.fzk.de/pages/home.php&lt;br /&gt;
**Accounting Info http://www2.egee.cesga.es/gridsite/accounting/CESGA/tree_egee.php?Path=1.7&lt;br /&gt;
**Availabilty/Reliabilty http://gridview.cern.ch/GRIDVIEW/same_index.php&lt;br /&gt;
**Modded version (only displays UCL and ULB-VUB): http://mon.iihe.ac.be/modded_gridview/same_index.php&lt;br /&gt;
**Basic plots&lt;br /&gt;
***Select &amp;lt;tt&amp;gt;Tier-2 Site Availability&amp;lt;/tt&amp;gt;&lt;br /&gt;
***Select both sites&lt;br /&gt;
***Select eg daily plots for one month&lt;br /&gt;
&lt;br /&gt;
*Belnet Network Monitor http://monitor.belnet.be/graph/grapherrd.php: (This page only works from whitin BELNET client network.)&lt;br /&gt;
**the Inbound/Outbound are from BELNET view (ie Outbound BELNET = Incoming university traffic)&lt;br /&gt;
**the &amp;quot;Free traffic&amp;quot; is Research traffic, the red line is a limit for commercial traffic. &lt;br /&gt;
**UCL (whole campus) http://monitor.belnet.be/graph/grapherrd.php?page=lln.cfg&amp;amp;target=ucl&lt;br /&gt;
**ULB-VUB (whole campus) http://monitor.belnet.be/graph/grapherrd.php?page=brussels.cfg&amp;amp;target=ulb-vub&lt;br /&gt;
&lt;br /&gt;
*CMS&lt;br /&gt;
**Dashboard http://arda-dashboard.cern.ch/cms/&lt;br /&gt;
**Monalisa http://monalisa.cacr.caltech.edu:9090/&lt;br /&gt;
**CRAB http://cmsgridweb.pg.infn.it/crab/crabmon.php&lt;br /&gt;
**Phedex Heartbeat http://cmsdoc.cern.ch/cms/aprom/TransferHeartbeat/browser&lt;br /&gt;
**Phedex on Dashboard http://pcardabg.cern.ch:8080/dashboard/phedex/&lt;br /&gt;
**Installed software https://twiki.cern.ch/twiki/bin/view/CMS/CMSSWInstStatus&lt;br /&gt;
**SC4&lt;br /&gt;
**SC4 Site Status https://twiki.cern.ch/twiki/bin/view/CMS/SWIntSC4SiteStatus&lt;br /&gt;
**CSA06&lt;br /&gt;
**Wiki: https://uimon.cern.ch/twiki/bin/view/CMS/CSA06&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{TracNotice|{{PAGENAME}}}}&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Register_to_the_IceCube_VO&amp;diff=693</id>
		<title>Register to the IceCube VO</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Register_to_the_IceCube_VO&amp;diff=693"/>
		<updated>2016-05-20T17:25:20Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* Register with your certificate here :&lt;br /&gt;
&lt;br /&gt;
https://grid-voms.desy.de:8443/voms/icecube&lt;br /&gt;
&lt;br /&gt;
* Test on m machines :&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
voms-proxy-init --voms icecube&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Monitoring&amp;diff=684</id>
		<title>Monitoring</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Monitoring&amp;diff=684"/>
		<updated>2016-05-18T17:24:18Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|  &#039;&#039;&#039;Nagios old style&#039;&#039;&#039;&lt;br /&gt;
|  &#039;&#039;&#039;Nagios Check_mk style&#039;&#039;&#039;&lt;br /&gt;
|  &#039;&#039;&#039;Nagios History&#039;&#039;&#039;&lt;br /&gt;
|  &#039;&#039;&#039;JSON&#039;&#039;&#039;  &lt;br /&gt;
|  &#039;&#039;&#039;Python&#039;&#039;&#039; &lt;br /&gt;
|-&lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/status.cgi?host=cream02.iihe.ac.be][cream02]]&lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=cream02.iihe.ac.be][cream02]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/notifications.cgi?host=cream02.iihe.ac.be][cream02]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=cream02.iihe.ac.be&amp;amp;output_format=json][cream02]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=cream02.iihe.ac.be&amp;amp;output_format=python][cream02]]   &lt;br /&gt;
|-&lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/status.cgi?host=maite.iihe.ac.be][maite]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=maite.iihe.ac.be][maite]] &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/nagios/cgi-bin/notifications.cgi?host=maite.iihe.ac.be][maite]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=maite.iihe.ac.be&amp;amp;output_format=json][maite]]  &lt;br /&gt;
|  [[https://etf-cms-prod.cern.ch/etf/check_mk/view.py?view_name=host&amp;amp;host=maite.iihe.ac.be&amp;amp;output_format=python][maite]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
Links:&lt;br /&gt;
&lt;br /&gt;
*Pilot Monitoring (CMS):&lt;br /&gt;
** http://submit-3.t2.ucsd.edu/CSstoragePath/aperez/T2s/multicore_occupancy_t2s_168h.html&lt;br /&gt;
** http://submit-3.t2.ucsd.edu/CSstoragePath/aperez/HTML/site_status/status_mcore_T2_BE_IIHE.html&lt;br /&gt;
** http://cms-gwmsmon.cern.ch/totalview/T2_BE_IIHE&lt;br /&gt;
&lt;br /&gt;
*VUB-ULB GRID:&lt;br /&gt;
**Ganglia http://mon.iihe.ac.be/ganglia&lt;br /&gt;
&lt;br /&gt;
*BEgrid:&lt;br /&gt;
**Ganglia https://ganglia.begrid.be/ganglia/&lt;br /&gt;
**GridIce https://gridice.begrid.be/gridice&lt;br /&gt;
&lt;br /&gt;
*LCG&lt;br /&gt;
**GocDB/Gstat http://goc.grid.sinica.edu.tw/gstat/&lt;br /&gt;
**GridIce http://gridice2.cnaf.infn.it:50080/gridice/site/site.php&lt;br /&gt;
**Admin SFTs https://monitoring.egee.man.poznan.pl/admin2&lt;br /&gt;
**GGUS Ticket Service https://gus.fzk.de/pages/home.php&lt;br /&gt;
**Accounting Info http://www2.egee.cesga.es/gridsite/accounting/CESGA/tree_egee.php?Path=1.7&lt;br /&gt;
**Availabilty/Reliabilty http://gridview.cern.ch/GRIDVIEW/same_index.php&lt;br /&gt;
**Modded version (only displays UCL and ULB-VUB): http://mon.iihe.ac.be/modded_gridview/same_index.php&lt;br /&gt;
**Basic plots&lt;br /&gt;
***Select &amp;lt;tt&amp;gt;Tier-2 Site Availability&amp;lt;/tt&amp;gt;&lt;br /&gt;
***Select both sites&lt;br /&gt;
***Select eg daily plots for one month&lt;br /&gt;
&lt;br /&gt;
*Belnet Network Monitor http://monitor.belnet.be/graph/grapherrd.php: (This page only works from whitin BELNET client network.)&lt;br /&gt;
**the Inbound/Outbound are from BELNET view (ie Outbound BELNET = Incoming university traffic)&lt;br /&gt;
**the &amp;quot;Free traffic&amp;quot; is Research traffic, the red line is a limit for commercial traffic. &lt;br /&gt;
**UCL (whole campus) http://monitor.belnet.be/graph/grapherrd.php?page=lln.cfg&amp;amp;target=ucl&lt;br /&gt;
**ULB-VUB (whole campus) http://monitor.belnet.be/graph/grapherrd.php?page=brussels.cfg&amp;amp;target=ulb-vub&lt;br /&gt;
&lt;br /&gt;
*CMS&lt;br /&gt;
**Dashboard http://arda-dashboard.cern.ch/cms/&lt;br /&gt;
**Monalisa http://monalisa.cacr.caltech.edu:9090/&lt;br /&gt;
**CRAB http://cmsgridweb.pg.infn.it/crab/crabmon.php&lt;br /&gt;
**Phedex Heartbeat http://cmsdoc.cern.ch/cms/aprom/TransferHeartbeat/browser&lt;br /&gt;
**Phedex on Dashboard http://pcardabg.cern.ch:8080/dashboard/phedex/&lt;br /&gt;
**Installed software https://twiki.cern.ch/twiki/bin/view/CMS/CMSSWInstStatus&lt;br /&gt;
**SC4&lt;br /&gt;
**SC4 Site Status https://twiki.cern.ch/twiki/bin/view/CMS/SWIntSC4SiteStatus&lt;br /&gt;
**CSA06&lt;br /&gt;
**Wiki: https://uimon.cern.ch/twiki/bin/view/CMS/CSA06&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{TracNotice|{{PAGENAME}}}}&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Network_bond_and_tag&amp;diff=668</id>
		<title>Network bond and tag</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Network_bond_and_tag&amp;diff=668"/>
		<updated>2016-04-21T12:38:30Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* On the switch */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=== On the host ===&lt;br /&gt;
&lt;br /&gt;
In &#039;&#039;&#039;/etc/sysconfig/network-scripts&#039;&#039;&#039;:&lt;br /&gt;
:* ifcfg-bond0&lt;br /&gt;
&amp;lt;pre&amp;gt;DEVICE=&amp;quot;bond0&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
IPADDR=192.168.10.135&lt;br /&gt;
BROADCAST=192.168.255.255&lt;br /&gt;
NETMASK=255.255.0.0&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
BONDING_MASTER=yes&lt;br /&gt;
TYPE=Bond&lt;br /&gt;
BONDING_OPTS=&amp;quot;mode=4 miimon=100 lacp_rate=0&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To get second bond interface tagged to vlan 2 (PUB) &lt;br /&gt;
 vconfig add bond0 2&lt;br /&gt;
&lt;br /&gt;
:* ifcfg-bond0.2&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;bond0.2&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
ONPARENT=yes&lt;br /&gt;
IPADDR=193.58.172.92&lt;br /&gt;
BROADCAST=&amp;quot;193.58.172.127&amp;quot;&lt;br /&gt;
NETMASK=255.255.255.128&lt;br /&gt;
GATEWAY=193.58.172.2&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
VLAN=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Aggregate interfaces eth0 and eth1 to the bond:&lt;br /&gt;
** ifcfg-eth0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;eth0&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
MASTER=bond0&lt;br /&gt;
SLAVE=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
:* ifcfg-eth1&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;eth1&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
MASTER=bond0&lt;br /&gt;
SLAVE=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== On the switch ===&lt;br /&gt;
We are configuring everything in LACP [&amp;lt;=&amp;gt; mode=4 in MODPROBE_OPTS for bond0]&lt;br /&gt;
* Adding the PoX interface on the 10G switch&lt;br /&gt;
** Using the interface, create the lacp port first in tab &#039;&#039;&#039;ETH Mgmt&#039;&#039;&#039;, click on &#039;&#039;&#039;Link Aggregation&#039;&#039;&#039;, Provide in the box &#039;&#039;&#039;LAG Interface Number&#039;&#039;&#039; PoX (X a number).&lt;br /&gt;
** On tab &#039;&#039;&#039;Ports&#039;&#039;&#039;, click on the port you want to add, select &#039;&#039;&#039;LAG&#039;&#039;&#039;: PoX, &#039;&#039;&#039;LAG mode&#039;&#039;&#039;: active (&#039;&#039;&#039;LACP rate&#039;&#039;&#039; is auto on slow [&amp;lt;=&amp;gt; lacp_rate=0 in MODPROBE_OPTS for bond0] )&lt;br /&gt;
::: &#039;&#039;If is does not work and complains about vlans, follow the section &#039;&#039;&#039;Adding a new port to PoX&#039;&#039;&#039; below&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* on the switch in CLI:&lt;br /&gt;
** the PoX port is going to be in hybrid mode, to have untagged[=access] vlan1 and tagged[=hybrid allowed-vlan] vlan 2 : &#039;&#039;&#039;[ vlan1 cannot be tagged on the switch ]&#039;&#039;&#039;&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
cd somewhere ?&lt;br /&gt;
switchport mode hybrid&lt;br /&gt;
switchport access vlan 1&lt;br /&gt;
switchport hybrid allowed-vlan 2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
:* Adding a new port to PoX&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
cd port&lt;br /&gt;
switchport mode hybrid&lt;br /&gt;
switchport access vlan 1&lt;br /&gt;
switchport hybrid allowed-vlan 2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
then on the interface, add the port to PoX following the previous section.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Network_bond_and_tag&amp;diff=667</id>
		<title>Network bond and tag</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Network_bond_and_tag&amp;diff=667"/>
		<updated>2016-04-21T12:37:24Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* On the switch */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=== On the host ===&lt;br /&gt;
&lt;br /&gt;
In &#039;&#039;&#039;/etc/sysconfig/network-scripts&#039;&#039;&#039;:&lt;br /&gt;
:* ifcfg-bond0&lt;br /&gt;
&amp;lt;pre&amp;gt;DEVICE=&amp;quot;bond0&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
IPADDR=192.168.10.135&lt;br /&gt;
BROADCAST=192.168.255.255&lt;br /&gt;
NETMASK=255.255.0.0&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
BONDING_MASTER=yes&lt;br /&gt;
TYPE=Bond&lt;br /&gt;
BONDING_OPTS=&amp;quot;mode=4 miimon=100 lacp_rate=0&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To get second bond interface tagged to vlan 2 (PUB) &lt;br /&gt;
 vconfig add bond0 2&lt;br /&gt;
&lt;br /&gt;
:* ifcfg-bond0.2&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;bond0.2&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
ONPARENT=yes&lt;br /&gt;
IPADDR=193.58.172.92&lt;br /&gt;
BROADCAST=&amp;quot;193.58.172.127&amp;quot;&lt;br /&gt;
NETMASK=255.255.255.128&lt;br /&gt;
GATEWAY=193.58.172.2&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
VLAN=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Aggregate interfaces eth0 and eth1 to the bond:&lt;br /&gt;
** ifcfg-eth0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;eth0&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
MASTER=bond0&lt;br /&gt;
SLAVE=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
:* ifcfg-eth1&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;eth1&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
MASTER=bond0&lt;br /&gt;
SLAVE=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== On the switch ===&lt;br /&gt;
We are configuring everything in LACP [&amp;lt;=&amp;gt; mode=4 in MODPROBE_OPTS for bond0]&lt;br /&gt;
* Adding the PoX interface on the 10G switch&lt;br /&gt;
** Using the interface, create the lacp port first in tab &#039;&#039;&#039;ETH Mgmt&#039;&#039;&#039;, click on &#039;&#039;&#039;Link Aggregation&#039;&#039;&#039;, Provide in the box &#039;&#039;&#039;LAG Interface Number&#039;&#039;&#039; PoX (X a number).&lt;br /&gt;
** On tab &#039;&#039;&#039;Ports&#039;&#039;&#039;, click on the port you want to add, select &#039;&#039;&#039;LAG&#039;&#039;&#039;: PoX, &#039;&#039;&#039;LAG mode&#039;&#039;&#039;: active (&#039;&#039;&#039;LACP rate&#039;&#039;&#039; is auto on slow [&amp;lt;=&amp;gt; lacp_rate=0 in MODPROBE_OPTS for bond0] )&lt;br /&gt;
::: &#039;&#039;If is does not work and complains about vlans, follow the section &#039;&#039;&#039;Adding a new port to PoX&#039;&#039;&#039; below&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* on the switch in CLI:&lt;br /&gt;
** the PoX port is going to be in hybrid mode, to have untagged[=access] vlan1 and tagged[=hybrid allowed-vlan] vlan 2 : &#039;&#039;&#039;[ vlan1 cannot be tagged on the switch ]&#039;&#039;&#039;&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
cd somewhere ?&lt;br /&gt;
switchport mode hybrid&lt;br /&gt;
switchport access vlan 1&lt;br /&gt;
switchport hybrid allowed-vlan 2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
:* Adding a new port to PoX&lt;br /&gt;
cd port&lt;br /&gt;
switchport mode hybrid&lt;br /&gt;
switchport access vlan 1&lt;br /&gt;
switchport hybrid allowed-vlan 2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
then on the interface, add the port to PoX following the previous section.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Network_bond_and_tag&amp;diff=666</id>
		<title>Network bond and tag</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Network_bond_and_tag&amp;diff=666"/>
		<updated>2016-04-21T12:18:15Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: Created page with &amp;quot;=== On the host ===  In &amp;#039;&amp;#039;&amp;#039;/etc/sysconfig/network-scripts&amp;#039;&amp;#039;&amp;#039;: :* ifcfg-bond0 &amp;lt;pre&amp;gt;DEVICE=&amp;quot;bond0&amp;quot; BOOTPROTO=none IPADDR=192.168.10.135 BROADCAST=192.168.255.255 NETMASK=255.255...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=== On the host ===&lt;br /&gt;
&lt;br /&gt;
In &#039;&#039;&#039;/etc/sysconfig/network-scripts&#039;&#039;&#039;:&lt;br /&gt;
:* ifcfg-bond0&lt;br /&gt;
&amp;lt;pre&amp;gt;DEVICE=&amp;quot;bond0&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
IPADDR=192.168.10.135&lt;br /&gt;
BROADCAST=192.168.255.255&lt;br /&gt;
NETMASK=255.255.0.0&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
BONDING_MASTER=yes&lt;br /&gt;
TYPE=Bond&lt;br /&gt;
BONDING_OPTS=&amp;quot;mode=4 miimon=100 lacp_rate=0&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To get second bond interface tagged to vlan 2 (PUB) &lt;br /&gt;
 vconfig add bond0 2&lt;br /&gt;
&lt;br /&gt;
:* ifcfg-bond0.2&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;bond0.2&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
ONPARENT=yes&lt;br /&gt;
IPADDR=193.58.172.92&lt;br /&gt;
BROADCAST=&amp;quot;193.58.172.127&amp;quot;&lt;br /&gt;
NETMASK=255.255.255.128&lt;br /&gt;
GATEWAY=193.58.172.2&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
VLAN=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
* Aggregate interfaces eth0 and eth1 to the bond:&lt;br /&gt;
** ifcfg-eth0&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;eth0&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
MASTER=bond0&lt;br /&gt;
SLAVE=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
:* ifcfg-eth1&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
DEVICE=&amp;quot;eth1&amp;quot;&lt;br /&gt;
BOOTPROTO=none&lt;br /&gt;
NM_CONTROLLED=&amp;quot;no&amp;quot;&lt;br /&gt;
ONBOOT=&amp;quot;yes&amp;quot;&lt;br /&gt;
USERCTL=no&lt;br /&gt;
MASTER=bond0&lt;br /&gt;
SLAVE=yes&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== On the switch ===&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=AdminPage&amp;diff=665</id>
		<title>AdminPage</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=AdminPage&amp;diff=665"/>
		<updated>2016-04-21T12:10:21Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{DISPLAYTITLE:Page for Administrators}}&lt;br /&gt;
==== Management of the whole cluster ====&lt;br /&gt;
*[[elog]]&lt;br /&gt;
*[[ShutDownCluster| How to properly switch off the cluster]]&lt;br /&gt;
*[[PutClusterOn| How to properly put the cluster on]]&lt;br /&gt;
==== CMS Services ====&lt;br /&gt;
*[[Phedex]]&lt;br /&gt;
*[[Heartbeat]]&lt;br /&gt;
*[[LoadTest]]&lt;br /&gt;
*[[FroNTier]]&lt;br /&gt;
*[[ProdAgent]]&lt;br /&gt;
*[[GitForSiteConf| instructions to commit siteconf to git]]&lt;br /&gt;
==== Grid Configuration Issues ====&lt;br /&gt;
*[[UpdateCertificates| Update the certificates of all our machines]]&lt;br /&gt;
*[[CreamIssues| Issues with cream and how to solve them]]&lt;br /&gt;
*[[PBS_TMPDIR| PBS TMPDIR]]&lt;br /&gt;
*[[APEL| &amp;lt;strike&amp;gt;APEL&amp;lt;/strike&amp;gt;(OBSOLETE)]]&lt;br /&gt;
*[[BDII]]&lt;br /&gt;
*[[FTS]]&lt;br /&gt;
*[[SL4_x86_64_WNs| &amp;lt;strike&amp;gt;SL4 x86_64 WNs&amp;lt;/strike&amp;gt;(OBSOLETE)]]&lt;br /&gt;
*[[CE_oveloaded| CE overloaded]]&lt;br /&gt;
*[[RB]]&lt;br /&gt;
*[[IPMI]]&lt;br /&gt;
*[[CA_certificates| &amp;lt;strike&amp;gt;Upgrade CA certificates&amp;lt;/strike&amp;gt; (OBSOLETE)]]&lt;br /&gt;
*[[Shutdown| Shutting down the cluster]]&lt;br /&gt;
*[[Software_Area_Switch| Software Area Switch]]&lt;br /&gt;
*[[KernelUpdate| Kernel mandatory updates for critical vulnerabilities]]&lt;br /&gt;
*[[Argus| Argus server and glexec on the workernodes]]&lt;br /&gt;
*[[ApelGapPublishing| Apel gap publishing]]&lt;br /&gt;
*[[UpdateCACertificates| Update IGTF CA certificates]]&lt;br /&gt;
&lt;br /&gt;
==== Files section ====&lt;br /&gt;
*[[DCache| dCache]]&lt;br /&gt;
**[[DeleteObsoleteFiles| Find obsolete files]]&lt;br /&gt;
**[[DCacheAdminMode| dCache admin mode]]&lt;br /&gt;
**[[FindSizeOnDcache| find size of a directory on dcache]]&lt;br /&gt;
**[[DcachePoolConfig1912| dCache Pool Postinstallation steps]]&lt;br /&gt;
**[[DCacheMaxMovers| Adapt max mover]]&lt;br /&gt;
**[[pnfsScripts| scripts to see on what pools files in a directory reside and to move them to other pools]]&lt;br /&gt;
*[[OlPnfsFiles| Procedure for removal of old user files on pnfs]]&lt;br /&gt;
*[[GetLostFiles| Retrieve lost files from datasets]]&lt;br /&gt;
*[[StorageConsistency| Storage Consistency]]&lt;br /&gt;
&lt;br /&gt;
==== Status and Monitoring ====&lt;br /&gt;
*[[ReservedWNs| List of reserved WNs]]&lt;br /&gt;
*[[Todo| Todo-list]]&lt;br /&gt;
*[[Monitoring]]&lt;br /&gt;
*[[Plans-Schedule| Plans/Schedule]]&lt;br /&gt;
*[[Grid_Troubleshooting_link| Grid Troubleshooting link]]&lt;br /&gt;
*[[Incident_reports| Incident Reports]]&lt;br /&gt;
*[[Dissapeared_software| How to put the software back]]&lt;br /&gt;
*[[Bad_WN| What to do when a WN sends a &amp;quot;bad_wn.pl&amp;quot; email to grid_admin ?]]&lt;br /&gt;
*[[Nagios_installation| Nagios Installation at IIHE]]&lt;br /&gt;
*[[Restart_DCache| How to restart DCache ]]&lt;br /&gt;
&lt;br /&gt;
==== Info ====&lt;br /&gt;
*[[General_info| General info]]&lt;br /&gt;
*[[Installing_CMSSW| Installing CMSSW]]&lt;br /&gt;
*[[Installing_CRAB| Installing CRAB]]&lt;br /&gt;
*[[System_Benchmarks| System Benchmarks]]&lt;br /&gt;
*[[T2BTrac| T2B Trac config info]]&lt;br /&gt;
*[[HardWare| Hardware information]]&lt;br /&gt;
*[[NetworkSetup| Network Setup]]&lt;br /&gt;
*[[SetupMonitoringControlerSunfireV20z| Setup Monitoring of LSI Disk Controler on Sunfire V20z Server]]&lt;br /&gt;
*[[LDAP_UCL_IIHE| &amp;lt;strike&amp;gt;LDAP authentication system for the replication between UCL and IIHE sites&amp;lt;/strike&amp;gt; (OBSOLETE)]]&lt;br /&gt;
*[[GridAdminSurvivalGuide| IIHE Grid-admin survival guide]]&lt;br /&gt;
*[[Solaris| Solaris 10]]&lt;br /&gt;
*[[SolarisSSD| Adding an SSD card and configuring RAID, zpools, filesystems and shares on the new Solaris fileserver]]&lt;br /&gt;
*[[LinuxAdminTricks| Linux tricks for admins]]&lt;br /&gt;
*[[CrabLocalPbsSubmission| &amp;lt;strike&amp;gt;How to implement local PBS submission with CRAB ?&amp;lt;/strike&amp;gt;(OBSOLETE)]]&lt;br /&gt;
*[[AddNewUserFromUCLToLDAP| &amp;lt;strike&amp;gt;How to create an account for a CMS user from UCL ?&amp;lt;/strike&amp;gt;(OBSOLETE)]]&lt;br /&gt;
*[[OSErrata| &amp;lt;strike&amp;gt;Deploying OS errata&amp;lt;/strike&amp;gt;(OBSOLETE)]]&lt;br /&gt;
*[[BenchmarkHEPSPEC06| Howto benchmark a node with HEPSPEC06]]&lt;br /&gt;
*[[Installing_dcache_pool| Install a new dCache pool]]&lt;br /&gt;
*[[BackupUsersHomeDirs| Backup of the users home dirs on Jefke]]&lt;br /&gt;
*[[MonWebServicesMigration| Migration of mon and its Web services]]&lt;br /&gt;
*[[HOWTORestartNagiosTest| HOWTO restart a nagios test manually]]&lt;br /&gt;
*[[CompileAndInstallRoot| Compile and install ROOT]]&lt;br /&gt;
*[[CleanCreamdb| Clean creamdb]]&lt;br /&gt;
*Reboot campaign for the workernodes :&lt;br /&gt;
**[[KernelUpdate| Reboot after a kernel update]]&lt;br /&gt;
**[[UpgradeWNstoSL5.5| Reboot after an OS upgrade]]&lt;br /&gt;
*[[ManageAllAdminScriptsWithGit| Central management of all the admin scripts with Git]]&lt;br /&gt;
*[[ConfigProxyCvmfs| Configuration of a proxy for CVMFS]]&lt;br /&gt;
**[[RecoverCvmfs| How to recover CVMFS]]&lt;br /&gt;
*[[TestNFSPerformance| How to test NFS Performance]]&lt;br /&gt;
*[[TetexNotAvailableInSL6| Alternatives to Tetex]]&lt;br /&gt;
*[[NewMethodUpdateKernelWorkernodes| A new easy method to update kernel on the workernodes]]&lt;br /&gt;
*[[AutomaticMailSendingFromCluster| About automatic mail sending from the cluster]]&lt;br /&gt;
*[[T2BTracAccess| T2B Trac access configuration]]&lt;br /&gt;
*[[WorkingWithRHEL7| Surviving to RHEL7]]&lt;br /&gt;
*[[CCMWithKerberos| Experimental : Securing profiles with Kerberos]]&lt;br /&gt;
*[[MigrateToMediaWiki| Migration of T2B Wiki from Trac to MediaWiki]]&lt;br /&gt;
*[[motd|Message Of The Day (motd)]]&lt;br /&gt;
&lt;br /&gt;
==== Quattor ====&lt;br /&gt;
*[http://quattor.begrid.be/trac/centralised-begrid-v5/wiki/BEgridAndQuattor &amp;lt;strike&amp;gt;BEgrid wiki&amp;lt;/strike&amp;gt;(OBSOLETE)]&lt;br /&gt;
*[[Test_things| &amp;lt;strike&amp;gt;Test things&amp;lt;/strike&amp;gt;(OBSOLETE)]]&lt;br /&gt;
*[[Lemon_installation| &amp;lt;strike&amp;gt;Lemon installation&amp;lt;/strike&amp;gt;(OBSOLETE)]]&lt;br /&gt;
*[[QuattorPointers| &amp;lt;strike&amp;gt;Pointers&amp;lt;/strike&amp;gt;]]&amp;lt;strike&amp;gt; to more in-depth information on quattor&amp;lt;/strike&amp;gt;(OBSOLETE)&lt;br /&gt;
*[[AddingMachineToCluster| &amp;lt;strike&amp;gt;Adding&amp;lt;/strike&amp;gt;]]&amp;lt;strike&amp;gt; a new machine to the cluster&amp;lt;/strike&amp;gt;(OBSOLETE)&lt;br /&gt;
*[[AutomaticMachineTemplateGeneration| Automatic generation of hardware and profile templates for new workernodes]]&lt;br /&gt;
*[[InstallationBEgridClient0| Installation of a Quattor deployment server release 13.1]]&lt;br /&gt;
*[[InstallFilesNewOS| How to add a new OS to the Quattor Repository]]&lt;br /&gt;
*[[GenerateRPMFromATagInGithub| How to build an RPM from a tag in Github]]&lt;br /&gt;
*[[HowtoMigrateWNToCB9| How to migrate workernodes from CB8 to CB9]]&lt;br /&gt;
*[[WorkingInCB9| Working in CB9 (Quattor release &amp;gt;= 14.2)]]&lt;br /&gt;
*[[AideMemoire| FAQ - Aide-mémoire - Howtos]]&lt;br /&gt;
*[[BuildANewPysvnOnAiiServer| Howto build a new pysvn on a SL63 AII server]]&lt;br /&gt;
*[[QuattorFreeIPA| Quattor and FreeIPA]]&lt;br /&gt;
*[[NewRuncheck| Rewrite of the runcheck script in Perl]]&lt;br /&gt;
*[[HardDisksManagement| Hard disks management]]&lt;br /&gt;
*[[Aquilon| Aquilon]]&lt;br /&gt;
&lt;br /&gt;
==== KVM virtualization ====&lt;br /&gt;
*[[VirtWithKVM| Virtualization of the new CREAM-CE on dom02 with KVM]]&lt;br /&gt;
*[[VirtWithKVM1| Installation of the new virtualization server dom04]]&lt;br /&gt;
*[[CreateVM| Easy creation of virtual machines]]&lt;br /&gt;
*[[MonitoringvHostswithGanglia| Monitoring the KVM vHosts with Ganglia]]&lt;br /&gt;
&lt;br /&gt;
==== T2B Cloud ====&lt;br /&gt;
*[[MigrationToOpenNebula| Transforming the KVM hypervisors farm into an OpenNebula cloud]]&lt;br /&gt;
*[[WorkingInT2BCloud| Working in the T2B cloud]]&lt;br /&gt;
&lt;br /&gt;
==== gUSE/WS-PGRADE portal ====&lt;br /&gt;
*[[PortalInstall| Portal installation]]&lt;br /&gt;
*[[PortalConfig| Portal configuration]]&lt;br /&gt;
*[[PortalOperations| Portal operations]]&lt;br /&gt;
&lt;br /&gt;
==== Migration to EMI-3 ====&lt;br /&gt;
*[[MigrateBEgridToEMI3_part1| BEgrid facilities - Part 1]]&lt;br /&gt;
*[[MigrateBEgridToEMI3_part2| BEgrid facilities - Part 2]]&lt;br /&gt;
&lt;br /&gt;
==== XEN ====&lt;br /&gt;
*[[Manage_XEN| Manage XEN]]&lt;br /&gt;
*[[XenQuattor| Xen and Quattor]]&lt;br /&gt;
&lt;br /&gt;
==== CEPH ====&lt;br /&gt;
*[[UnderstandingCeph| Understanding Ceph]]&lt;br /&gt;
*[[InstallCephWithQuattor| Installing Ceph with Quattor]]&lt;br /&gt;
*[[ExperimentsWithCeph| Experiments with Ceph]]&lt;br /&gt;
*[[CephBasics| Operating a Ceph cluster]]&lt;br /&gt;
&lt;br /&gt;
==== Logstash / Elasticsearch / Kibana (ELK) ====&lt;br /&gt;
machine: log10 | [http://log10.iihe.ac.be/index.html interface]  |  [http://log10.iihe.ac.be/HQ index manager]&lt;br /&gt;
* [[log_forwarding_with_quattor|Forwarding a log with rsyslog to logstash using quattor]]&lt;br /&gt;
* [[log_parsing_with_logstash|Parsing the logs with logstash]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Network ====&lt;br /&gt;
* [[network_bond_and_tag|Bonding of 2 interfaces + tagging of 2 vlans on the bond (PRIV+PUB)|]]&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Main_Page&amp;diff=663</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Main_Page&amp;diff=663"/>
		<updated>2016-04-06T15:19:27Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* General information for users */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Welcome to the CMS Belgian T2 Wiki ==&lt;br /&gt;
&amp;lt;center&amp;gt;&amp;lt;span style=&amp;quot;font-size: 300%;&amp;quot;&amp;gt; [[first_access_to_t2b|=&amp;gt; FIRST ACCESS TO T2B &amp;lt;=]] &amp;lt;/span&amp;gt;&amp;lt;/center&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== General information for users ===&lt;br /&gt;
*[[Faq_t2b | FAQ]]&lt;br /&gt;
*[[ Cluster_Presentation | Cluster Presentation ]]&lt;br /&gt;
==== Information for new users ====&lt;br /&gt;
*[[T2bSupport| T2B Support guidelines]]&lt;br /&gt;
*[[First_access_to_t2b|Getting access to T2B]]&lt;br /&gt;
*[[Getting_a_certificate_for_the_T2|Certificates and VOs]]&lt;br /&gt;
&lt;br /&gt;
==== Using the Tier2 computing resources ====&lt;br /&gt;
*[[policies| Policies concerning the usage of computing resources.]]&lt;br /&gt;
*[[CurrentStatus| Current status of the Tier2]]&lt;br /&gt;
*[[Getting_started_with_the_CMSSW_software| Getting started with the CMSSW software]]&lt;br /&gt;
*[[Using_Git| Using Git]]&lt;br /&gt;
*[[Getting_started_with_the_MadGraph_software| Getting started with the MadGraph software]]&lt;br /&gt;
*Submitting jobs with CRAB&lt;br /&gt;
**[[gridSubmission_withCrab| To the worldwide grid]]&lt;br /&gt;
*Submitting jobs without CRAB&lt;br /&gt;
**[[localSubmission| To the local resources]]&lt;br /&gt;
**[[gridSubmission| To the worldwide grid]]&lt;br /&gt;
*[[GridStorageAccess| How to handle data on Grid storage]]&lt;br /&gt;
*[[FAQ_CMSSW_on_the_Grid| FAQ CMSSW on the Grid on proxy and more!]]&lt;br /&gt;
*[[OtherSoftware| Other software available at the T2]]&lt;br /&gt;
&lt;br /&gt;
==== Back-up procedures ====&lt;br /&gt;
*[[BackupDocsLinuxLaptop| Procedure to automate backups of personal documents (Linux laptops only)]]&lt;br /&gt;
*[[Backup| Backup scheme for the user disks]]&lt;br /&gt;
*[[AccidentalDeleteOfFiles| What to do if I have accidentally deleted some files on my personal computer ?]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==== Other topics ====&lt;br /&gt;
*[[User_Meetings]] list of user meetings with added transparencies&lt;br /&gt;
*[[Basic_computing_skills| Basic computing skills]]&lt;br /&gt;
*[[CernLxplus| Useful info on use of lxplus.cern.ch]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Dedicated twiki pages maintained by several research groups ===&lt;br /&gt;
*[[TtBar_Analysis_Framework| TtBar Analysis Framework (old)]]&lt;br /&gt;
*[[TopQuarkGroup| Top Quark Group wiki]]&lt;br /&gt;
*[[HEEP_Analysis_Framework| HEEP Analysis Framework]]&lt;br /&gt;
*[[V0_Analysis_wiki| V0 Analysis wiki]]&lt;br /&gt;
*[[Info_exchange| Higgs analysis]]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Service work for CMS ===&lt;br /&gt;
*[[DDT]]&lt;br /&gt;
*[[CMSSWDeployment]]&lt;br /&gt;
*[[ProdAgentAllUsers| Prodagent for users]]&lt;br /&gt;
*[[TestStoreTemp| Writing tests in /store/temp/user on T2 SE&#039;s]]&lt;br /&gt;
*[[CrabServerInstall| CRAB Server installation]]&lt;br /&gt;
*[[CrabValidation| Basic validation of CRAB releases]]&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
&lt;br /&gt;
=== Obsolete twiki pages ===&lt;br /&gt;
*[[DIY-UI]]&lt;br /&gt;
*[[CrabIIHETransitionSLC4ToSLC5| Using CRAB at IIHE during the transition from SLC4 to SLC5]]&lt;br /&gt;
*[[DataSamplesRequests2007| Samples Requests for 2007]]&lt;br /&gt;
*[[DataSamplesRequests2008| Samples Requests for 2008]]&lt;br /&gt;
*[[CrabNewIIHE| Crab at IIHE]]&lt;br /&gt;
*[[OldMainPage| The old main page is kept here for reference]]&lt;br /&gt;
&lt;br /&gt;
== Admin section ==&lt;br /&gt;
*[[AdminPage| Pages for administrators]]&lt;br /&gt;
&lt;br /&gt;
== Getting started with MediaWiki ==&lt;br /&gt;
*[//meta.wikimedia.org/wiki/Help:Contents User&#039;s Guide]&lt;br /&gt;
*[//www.mediawiki.org/wiki/Manual:Configuration_settings Configuration settings list]&lt;br /&gt;
*[//www.mediawiki.org/wiki/Manual:FAQ MediaWiki FAQ]&lt;br /&gt;
*[https://lists.wikimedia.org/mailman/listinfo/mediawiki-announce MediaWiki release mailing list]&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=LocalSubmission&amp;diff=662</id>
		<title>LocalSubmission</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=LocalSubmission&amp;diff=662"/>
		<updated>2016-04-06T12:20:26Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Procedure */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Direct submission to local queue on the T2_BE_IIHE cluster ==&lt;br /&gt;
&lt;br /&gt;
=== Aim ===&lt;br /&gt;
&lt;br /&gt;
*The aim of this page is to provide a brief introduction how to submit to the localqueue.&lt;br /&gt;
*The localqueue allows to send executable code to the Tier2 cluster.&lt;br /&gt;
*This procedure can be used to run non-CMSSW code that need access to files on the Storage Element (SE) maite.iihe.ac.be.&lt;br /&gt;
*It is useful to use this procedure to not overload the User Interfaces (UIs) known as the mX machines.&lt;br /&gt;
&lt;br /&gt;
=== Procedure ===&lt;br /&gt;
&lt;br /&gt;
*Log in to a UI mX.iihe.ac.be; replace X with a number of choice. See [[policies]] about the policies on the UIs.&lt;br /&gt;
*Make a directory and prepare an executable.&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
mkdir directsubmissiontest&lt;br /&gt;
cd directsubmissiontest/&lt;br /&gt;
emacs script.sh&amp;amp;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
*Paste following code into script.sh. (see below)&lt;br /&gt;
*Execute the following command to submit the script to the local queue&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qsub -q localgrid@cream02 -o script.stdout -e script.stderr script.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
*Follow the progress of your job on the UI&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qstat -u $USER localgrid@cream02&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*Your job are finished if you don&#039;t see it anymore with qstat. You should now be able to find your output files in the directory you&#039;ve create:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/user/$USER/directsubmissiontest/script.stdout&lt;br /&gt;
/user/$USER/directsubmissiontest/script.stderr&lt;br /&gt;
/user/$USER/directsubmissiontest/&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Comments and FAQ ===&lt;br /&gt;
*In case you would like to access a root file you should copy it to the /scratch space on the workernode.&lt;br /&gt;
**/scratch is the native disk of the workernode and is several 100 GBs big.&lt;br /&gt;
**Each job is allotted a working directory that is cleaned automatically at the end of the job. This directory is store in the variable $TMPDIR&lt;br /&gt;
**Your procedure should look like this: &lt;br /&gt;
**copy the necessary root from /localgrid (if you have any) to $TMPDIR&lt;br /&gt;
**Make sure the output of the job is also written to $TMPDIR   &lt;br /&gt;
**Copy your output files back to /localgrid &lt;br /&gt;
**Do not read root files from /localgrid. This directory is not physically located on the workernode, it is mounted from the fileserver. Doing this will put a big load on the fileserver potentially causing the UIs to be slow.&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;****** IMPORTANT *******&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
If you use the local submission, please notice that you potentially can slow down our site. So please, copy all the files you will use during the job to /scratch to avoid this. &amp;lt;br&amp;gt;&lt;br /&gt;
Many thanks, &amp;lt;br&amp;gt;&lt;br /&gt;
The Admin Team&lt;br /&gt;
&lt;br /&gt;
*How to set CMSSW environment in a batch job?&lt;br /&gt;
&lt;br /&gt;
Add the following lines to your script :&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
pwd=$PWD&lt;br /&gt;
source $VO_CMS_SW_DIR/cmsset_default.sh                          # make scram available                                                                                                                                                             &lt;br /&gt;
cd /localgrid/&amp;lt;USER NAME&amp;gt;/path/to/CMSSW_4_1_4/src/               # your local CMSSW release                                                                                                                                                         &lt;br /&gt;
eval `scram runtime -sh`                                         # don&#039;t use cmsenv, won&#039;t work on batch                                                                                                                                            &lt;br /&gt;
cd $pwd&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
*How to make your proxy available during batch jobs?&lt;br /&gt;
&lt;br /&gt;
Make sure you have a valid proxy and copy it to some place on /localgrid :&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
cp $X509_USER_PROXY /localgrid/&amp;lt;USER NAME&amp;gt;/some/place&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Add the following line to your script :&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
export X509_USER_PROXY=/localgrid/&amp;lt;USER NAME&amp;gt;/some/place&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
*How to avoid my short jobs from being blocked in the waiting queue when the site is full ?&lt;br /&gt;
&lt;br /&gt;
If you intend to submit short jobs, then it is wise to specify explicitly to the batch system their estimated maximum walltime. You can do this by adding an option to the qsub command :&lt;br /&gt;
 &amp;lt;pre&amp;gt;&lt;br /&gt;
 qsub -q localgrid@cream02 -o script.stdout -e script.stderr -l walltime=&amp;lt;HH:MM:SS&amp;gt; script.sh&lt;br /&gt;
 &amp;lt;/pre&amp;gt;&lt;br /&gt;
or by adding the following line at the beginning of your job script :&lt;br /&gt;
 &amp;lt;pre&amp;gt;&lt;br /&gt;
 #PBS -l walltime=&amp;lt;HH:MM:SS&amp;gt;&lt;br /&gt;
 &amp;lt;/pre&amp;gt;&lt;br /&gt;
Proceeding this way, your jobs priority will grow faster as time goes by, increasing the chances of being executed first. (The shorter they are, the faster their priority will increase over the time.)&lt;br /&gt;
&lt;br /&gt;
But be aware that if your jobs are running longer then the specified maximum walltime, they will be killed by the batch system. So, don&#039;t hesitate to overestimate a bit this maximum walltime.&lt;br /&gt;
&lt;br /&gt;
=== Stop your jobs ===&lt;br /&gt;
&lt;br /&gt;
If for some reason, you want to stop your jobs on the server, you can use this procedure:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qstat @cream02 | grep &amp;lt;your user name&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
This will give you a list of jobs running with thier ID&#039;s. f.i.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
394402.cream02            submit.sh        odevroed               0 R localgrid&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
Now, use the ID to kill the job with the qdel command:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
qdel 394402.cream02&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Your job will now be removed.&lt;br /&gt;
&lt;br /&gt;
=== Attachments ===&lt;br /&gt;
&lt;br /&gt;
*script.sh&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#!/bin/bash          &lt;br /&gt;
&lt;br /&gt;
##Some general shell commands&lt;br /&gt;
STR=&amp;quot;Hello World!&amp;quot;&lt;br /&gt;
echo $STR    &lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; script.sh is checking where it is&amp;quot;&lt;br /&gt;
pwd&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; script.sh is checking how much disk space is still available&amp;quot;&lt;br /&gt;
df -h&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; script.sh is listing files and directories in the current location&amp;quot;&lt;br /&gt;
ls -l&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; script.sh is listing files and directories in userdir on storage element&amp;quot;&lt;br /&gt;
ls -l /pnfs/iihe/cms/store/user/$USER&lt;br /&gt;
&lt;br /&gt;
##When accessing files on the storage element it is important to execute your code on the /scratch partition of the workernode you are running on. Therefore you need to copy your executable which is accessing/writing root files onto the /scratch partition and execute it there. This is illustrated below.&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; go to TMPDIR&amp;quot;&lt;br /&gt;
cd $TMPDIR&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; ls of TMPDIR partition&amp;quot;&lt;br /&gt;
ls -l&lt;br /&gt;
&lt;br /&gt;
##Create a small root macro&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;{&lt;br /&gt;
  //TFile *MyFile = new TFile(\&amp;quot;testfile.root\&amp;quot;,\&amp;quot;RECREATE\&amp;quot;); &lt;br /&gt;
  //MyFile-&amp;gt;ls();&lt;br /&gt;
  //MyFile-&amp;gt;Close(),&lt;br /&gt;
  TFile* f=TFile::Open(\&amp;quot;dcap://maite.iihe.ac.be/pnfs/iihe/cms/store/user/$USER/testfile.root\&amp;quot;);&lt;br /&gt;
  f-&amp;gt;ls();&lt;br /&gt;
  f-&amp;gt;Close();&lt;br /&gt;
} &lt;br /&gt;
&amp;quot; &amp;gt; rootScript.C&lt;br /&gt;
&lt;br /&gt;
cat rootScript.C&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; set root&amp;quot;&lt;br /&gt;
##Copied a root version from /user/cmssoft into /localgrid&lt;br /&gt;
export ROOTSYS=/localgrid/$USER/cmssoft/root_5.26.00e_iihe_default_dcap/root &lt;br /&gt;
export PATH=$PATH:$ROOTSYS/bin &lt;br /&gt;
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$ROOTSYS/lib&lt;br /&gt;
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:~/lib&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; execute root macro&amp;quot;&lt;br /&gt;
root -q -l -b -n rootScript.C&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;&amp;gt;&amp;gt; ls of TMPDIR&amp;quot;&lt;br /&gt;
ls -l&lt;br /&gt;
&lt;br /&gt;
echo &amp;quot;copy the file back to the /localgrid sandbox&amp;quot;&lt;br /&gt;
#cp testfile.root /localgrid/jmmaes/directsubmissiontest&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{TracNotice|{{PAGENAME}}}}&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=661</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=661"/>
		<updated>2016-04-06T11:58:24Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Batch System */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue you want [mandatory]&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
:: For instance, if you want to use 2 cores: -lnodes=1:ppn=2&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=660</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=660"/>
		<updated>2016-04-06T11:52:35Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Job management */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job Statistics ===&lt;br /&gt;
All the log files from the batch system are synced every 30 minutes in:&lt;br /&gt;
 /group/log/torque/&lt;br /&gt;
&lt;br /&gt;
A simple script to analyze the logs and provide some statistics for the user is provided:&lt;br /&gt;
 /group/log/torque/torque-user-info.py&lt;br /&gt;
&lt;br /&gt;
Just execute it as is. It will print information like the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
ID: 6077555  ExCode:   0 Mem:    0M cpuT:      0s wallT:      3s eff:  0.0%   STDIN&lt;br /&gt;
ID: 6077602  ExCode:   0 Mem:   50M cpuT:      0s wallT:      2s eff:  0.0%   STDIN&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
   user[G]	# Jobs	 &amp;lt;MEM&amp;gt; +- RMS        #HiMem  MAX Mem      &amp;lt;CPU time&amp;gt;    &amp;lt;walltime&amp;gt;    &amp;lt;Eff&amp;gt;   % WT/WT_TOT    # Jobs with Error code (% of user job)&lt;br /&gt;
----------------------------------------------------------------------------------------------------------------------------------------------------------------&lt;br /&gt;
    rougny[l]	    12	    13 +- 22    MB      0      52 MB  |  00:00:00:00  00:00:00:24  ( 0.0%) (-1.00% of tot) | # EC: &lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=658</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=658"/>
		<updated>2016-04-06T09:41:40Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Queues */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=647</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=647"/>
		<updated>2016-02-26T13:05:24Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Queues */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt; &amp;lt;span style=&#039;color:red&#039;&amp;gt;01/2016 : Not enabled yet&amp;lt;/span&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3064&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 1536&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=First_access_to_t2b&amp;diff=624</id>
		<title>First access to t2b</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=First_access_to_t2b&amp;diff=624"/>
		<updated>2016-02-03T15:12:03Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Next Steps */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;There are a few steps to gain access to the T2:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;First thing to know:&#039;&#039;&#039; our contact mail is &#039;&#039;&#039;&#039;&#039;grid_admin AT listserv.vub.ac.be&#039;&#039;&#039;&#039;&#039; . Please always send mails to this list and not to personnal emails, this way everyone here can respond and keep up-to-date with problems.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
=== Getting access ===&lt;br /&gt;
* Now, to get access to our T2, send us a mail with the following :&lt;br /&gt;
*# To know you: add your promotor/boss in cc, and present yourself shortly (name, university, Physics group/experiment). Please do so preferrably from your preferred contact email, as it is the one we will keep to contact you.&lt;br /&gt;
*# Add to this email the login you want to use. If you have a CERN account, it needs to be the same (even if said account is not enabled anymore).&lt;br /&gt;
*# Finally, copy-paste the content of your &#039;&#039;&#039;PUBLIC&#039;&#039;&#039; ssh key in the mail, in order to have access to our User Interfaces (UIs, or often called the mX machines). To create a pair of ssh keys if you don&#039;t have one already, do the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
in a shell type:&lt;br /&gt;
&amp;gt;  ssh-keygen&lt;br /&gt;
and follow the instructions. The defaults suggested are fine. Just choose a password.&lt;br /&gt;
&lt;br /&gt;
this will create 2 files in the following directory: $HOME/.ssh&lt;br /&gt;
The files are:&lt;br /&gt;
  id_rsa&lt;br /&gt;
  id_rsa.pub&lt;br /&gt;
&lt;br /&gt;
You need to send us the content of id_rsa.pub, which is your public key. The other is your private key and should never be shared.&lt;br /&gt;
Note that if you already have one somewhere else, copying both id_rsa[.pub] files on your computer is fine too.&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: 4. [OPTIONAL] If you already have a certificate, please add your DN to the mail. The DN is given by the content of &amp;quot;Identity&amp;quot; when issuing the command voms-proxy-info.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*You need to register to the t2b user mailing-list. This is where we make important announcements to our users.&lt;br /&gt;
*# Go to the [https://e-groups.cern.ch/e-groups/EgroupsManageOwnerAdmin.do request page]&lt;br /&gt;
*# search for the list beginning with &#039;&#039;&#039;belgian-t2-users&#039;&#039;&#039;&lt;br /&gt;
*# then hit the button subscribe&lt;br /&gt;
*# If any of this fails, ask someone to contact the mailinglist admins (or ask them to send a mail to this list with your request)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Next Steps ===&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* If you need a grid certificate, follow this [[Getting_a_certificate_for_the_T2|twiki page]]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* Now that you have access to our T2, read the following:&lt;br /&gt;
*#[[Cluster_Presentation| Presentation of our cluster]] : please have a careful look at this page !&lt;br /&gt;
*# [[LocalSubmission|Local job submission]]&lt;br /&gt;
*: ... everything else you might need on the [[Main_Page|wiki front page]]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=First_access_to_t2b&amp;diff=623</id>
		<title>First access to t2b</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=First_access_to_t2b&amp;diff=623"/>
		<updated>2016-02-03T15:11:54Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Next Steps */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;There are a few steps to gain access to the T2:&lt;br /&gt;
&lt;br /&gt;
* &#039;&#039;&#039;First thing to know:&#039;&#039;&#039; our contact mail is &#039;&#039;&#039;&#039;&#039;grid_admin AT listserv.vub.ac.be&#039;&#039;&#039;&#039;&#039; . Please always send mails to this list and not to personnal emails, this way everyone here can respond and keep up-to-date with problems.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
=== Getting access ===&lt;br /&gt;
* Now, to get access to our T2, send us a mail with the following :&lt;br /&gt;
*# To know you: add your promotor/boss in cc, and present yourself shortly (name, university, Physics group/experiment). Please do so preferrably from your preferred contact email, as it is the one we will keep to contact you.&lt;br /&gt;
*# Add to this email the login you want to use. If you have a CERN account, it needs to be the same (even if said account is not enabled anymore).&lt;br /&gt;
*# Finally, copy-paste the content of your &#039;&#039;&#039;PUBLIC&#039;&#039;&#039; ssh key in the mail, in order to have access to our User Interfaces (UIs, or often called the mX machines). To create a pair of ssh keys if you don&#039;t have one already, do the following:&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
in a shell type:&lt;br /&gt;
&amp;gt;  ssh-keygen&lt;br /&gt;
and follow the instructions. The defaults suggested are fine. Just choose a password.&lt;br /&gt;
&lt;br /&gt;
this will create 2 files in the following directory: $HOME/.ssh&lt;br /&gt;
The files are:&lt;br /&gt;
  id_rsa&lt;br /&gt;
  id_rsa.pub&lt;br /&gt;
&lt;br /&gt;
You need to send us the content of id_rsa.pub, which is your public key. The other is your private key and should never be shared.&lt;br /&gt;
Note that if you already have one somewhere else, copying both id_rsa[.pub] files on your computer is fine too.&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: 4. [OPTIONAL] If you already have a certificate, please add your DN to the mail. The DN is given by the content of &amp;quot;Identity&amp;quot; when issuing the command voms-proxy-info.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
*You need to register to the t2b user mailing-list. This is where we make important announcements to our users.&lt;br /&gt;
*# Go to the [https://e-groups.cern.ch/e-groups/EgroupsManageOwnerAdmin.do request page]&lt;br /&gt;
*# search for the list beginning with &#039;&#039;&#039;belgian-t2-users&#039;&#039;&#039;&lt;br /&gt;
*# then hit the button subscribe&lt;br /&gt;
*# If any of this fails, ask someone to contact the mailinglist admins (or ask them to send a mail to this list with your request)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Next Steps ===&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* If you need a grid certificate, follow this [[Getting_a_certificate_for_the_T2|twiki page]]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* Now that you have access to our T2, read the following:&lt;br /&gt;
*#[[Cluster_Presentation| Presentation of our cluster]] : please have a careful look at this page !&lt;br /&gt;
*# [[LocalSubmission|Local job submission]]&lt;br /&gt;
*: ... everything else you might need on the [[Main_Page|twiki front page]]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=ManageAllAdminScriptsWithGit&amp;diff=622</id>
		<title>ManageAllAdminScriptsWithGit</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=ManageAllAdminScriptsWithGit&amp;diff=622"/>
		<updated>2016-02-03T12:50:37Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Summary Of Git Commands */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__TOC__&lt;br /&gt;
&lt;br /&gt;
* The admin scripts are managed by [http://git.iihe.ac.be Git]&lt;br /&gt;
* They should be mounted on all machines managed by quattor in &#039;&#039;&#039;/scripts&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
=== Git Workflow ===&lt;br /&gt;
==== Adding/Modifying the scripts from your computer ====&lt;br /&gt;
# Request an account from git admins&lt;br /&gt;
# add your ssh key to your profile&lt;br /&gt;
# Do an &#039;&#039;&#039;git clone git@git.iihe.ac.be:iihe-scripts.git&#039;&#039;&#039; to download the content of iihe-scripts locally &amp;lt;br&amp;gt; If you already have the script directory locally, you can first make sure you are on the master branch with &#039;&#039;&#039;git checkout master&#039;&#039;&#039; , and then pull the eventual updates with &#039;&#039;&#039;git pull&#039;&#039;&#039; &amp;lt;br&amp;gt; If you have made some changes to the master branch (wich you shouldn&#039;t!!), use &#039;&#039;&#039;git stash&#039;&#039;&#039; to &#039;&#039;store&#039;&#039; the changes and get a vanilla master, and when on the new branch (next step) use &#039;&#039;&#039;git stash apply&#039;&#039;&#039; to apply the changes to the current branch.&lt;br /&gt;
# Make a new branch from master &#039;&#039;&#039;git checkout -b BranchImWorkingOn&#039;&#039;&#039;. The branch name should reflect what you plan on doing. Never use directly the &#039;&#039;master&#039;&#039; branch !! &amp;lt;br&amp;gt; You can check this worked with &#039;&#039;&#039;git branch -ra&#039;&#039;&#039; that lists local &amp;amp; remote branches.&lt;br /&gt;
# Make your new scripts or modifications&lt;br /&gt;
# If you made new files/directories, prepare them for atomic commit: &#039;&#039;&#039;git add -N myfile&#039;&#039;&#039;&lt;br /&gt;
# Select the chunks of code you want to commit: &#039;&#039;&#039;git add -p&#039;&#039;&#039; , (explanations of letters [[ManageAllAdminScriptsWithGit#Summary_Of_Git_Commands|here]]). Be as small and precise as possible for each commits.&lt;br /&gt;
# If you want to revert the adding chunks to the index you just did : &#039;&#039;&#039;git reset -p&#039;&#039;&#039;&lt;br /&gt;
# Do the commit for the chunks you selected &#039;&#039;&#039;git commit -m&amp;quot;short message&amp;quot; [-m&amp;quot;long message&amp;quot;]&#039;&#039;&#039;. The long message is optional.&lt;br /&gt;
# To list the last commits &#039;&#039;&#039;git log -NumberOfCommits&#039;&#039;&#039;&lt;br /&gt;
# Last, you need to push the local changes to the remote server : &#039;&#039;&#039;git push origin BranchImWorkingOn&#039;&#039;&#039;&lt;br /&gt;
# Make a merge request on the [http://git.iihe.ac.be git site]: &amp;lt;br&amp;gt;[[File: git_merge_request.png|700px|center]]&lt;br /&gt;
&lt;br /&gt;
==== Summary Of Git Commands ====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
# First&lt;br /&gt;
git clone git.blahblah&lt;br /&gt;
git checkout master&lt;br /&gt;
git pull -p # this updates the local branch taking origin&lt;br /&gt;
git checkout -b newbranch&lt;br /&gt;
#or git branch -m oldbranch newbranch&lt;br /&gt;
&lt;br /&gt;
# if working on wrong branch&lt;br /&gt;
git stash&lt;br /&gt;
# then go to new branch&lt;br /&gt;
git stash apply&lt;br /&gt;
&lt;br /&gt;
git branch -ra ==&amp;gt; lists repo &amp;amp; local branches&lt;br /&gt;
&lt;br /&gt;
# Atomic commit (by hunks):&lt;br /&gt;
git add -N myfile #first prepare file before patchadding&lt;br /&gt;
git add -p&lt;br /&gt;
  y - indexer cette partie&lt;br /&gt;
  n - ne pas indexer cette partie&lt;br /&gt;
  a - indexer cette partie et toutes celles restantes dans ce fichier&lt;br /&gt;
  d - ne pas indexer cette partie ni aucune de celles restantes dans ce fichier&lt;br /&gt;
  g - sélectionner une partie à voir&lt;br /&gt;
  / - chercher une partie correspondant à la regexp donnée&lt;br /&gt;
  j - laisser cette partie non décidée, voir la prochaine partie non encore décidée&lt;br /&gt;
  J - laisser cette partie non décidée, voir la prochaine partie&lt;br /&gt;
  k - laisser cette partie non décidée, voir la partie non encore décidée précédente&lt;br /&gt;
  K - laisser cette partie non décidée, voir la partie précédente&lt;br /&gt;
  s - couper la partie courante en parties plus petites&lt;br /&gt;
  e - modifier manuellement la partie courante&lt;br /&gt;
  ? - afficher l&#039;aide&lt;br /&gt;
&lt;br /&gt;
  ==&amp;gt; this add hunks to index, needs git commit -m&amp;quot;&amp;quot;&lt;br /&gt;
git reset -p   ==&amp;gt; revert the adding to index&lt;br /&gt;
&lt;br /&gt;
git commit -m&amp;quot;&amp;quot; [-m&amp;quot;long message&amp;quot;]&lt;br /&gt;
&lt;br /&gt;
git diff          ==&amp;gt; local / index&lt;br /&gt;
git diff HEAD     ==&amp;gt; local / repo&lt;br /&gt;
git diff --cached ==&amp;gt; index / repo&lt;br /&gt;
&lt;br /&gt;
git log -3  ==&amp;gt; lists commit&lt;br /&gt;
&lt;br /&gt;
# Pushing&lt;br /&gt;
git push origin newbranch   ==&amp;gt; pushes local branch to repo &amp;amp; creates new repo branch&lt;br /&gt;
&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Adding /scripts to a machine and getting access to it ===&lt;br /&gt;
* You need to mount with nfs (or autofs) the /storage of tesla&lt;br /&gt;
* Then create a symlink from &#039;&#039;&#039;/storage_mnt/storage/group/admin/iihe-scripts&#039;&#039;&#039; to &#039;&#039;&#039;/scripts&#039;&#039;&#039;&lt;br /&gt;
* Or you can just include the template in &#039;&#039;&#039;config/nfs/storage&#039;&#039;&#039; in your machine, which does all this.&lt;br /&gt;
* It is in read-only for non-root users of the &#039;&#039;&#039;admins&#039;&#039;&#039; group, so you cannot make any modifications there. &lt;br /&gt;
* If you want access as non-root user, you need to add yourself in [http://freeipa.wn.iihe.ac.be freeipa] [accessible through tunnel only] to the &#039;&#039;&#039;admins&#039;&#039;&#039; group.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
=== Old Wiki page for using SVN ===&lt;br /&gt;
*[[ManageAllAdminScriptsWithSVN| here]]&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=ManageAllAdminScriptsWithGit&amp;diff=621</id>
		<title>ManageAllAdminScriptsWithGit</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=ManageAllAdminScriptsWithGit&amp;diff=621"/>
		<updated>2016-02-03T12:48:09Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Adding/Modifying the scripts from your computer */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__TOC__&lt;br /&gt;
&lt;br /&gt;
* The admin scripts are managed by [http://git.iihe.ac.be Git]&lt;br /&gt;
* They should be mounted on all machines managed by quattor in &#039;&#039;&#039;/scripts&#039;&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
=== Git Workflow ===&lt;br /&gt;
==== Adding/Modifying the scripts from your computer ====&lt;br /&gt;
# Request an account from git admins&lt;br /&gt;
# add your ssh key to your profile&lt;br /&gt;
# Do an &#039;&#039;&#039;git clone git@git.iihe.ac.be:iihe-scripts.git&#039;&#039;&#039; to download the content of iihe-scripts locally &amp;lt;br&amp;gt; If you already have the script directory locally, you can first make sure you are on the master branch with &#039;&#039;&#039;git checkout master&#039;&#039;&#039; , and then pull the eventual updates with &#039;&#039;&#039;git pull&#039;&#039;&#039; &amp;lt;br&amp;gt; If you have made some changes to the master branch (wich you shouldn&#039;t!!), use &#039;&#039;&#039;git stash&#039;&#039;&#039; to &#039;&#039;store&#039;&#039; the changes and get a vanilla master, and when on the new branch (next step) use &#039;&#039;&#039;git stash apply&#039;&#039;&#039; to apply the changes to the current branch.&lt;br /&gt;
# Make a new branch from master &#039;&#039;&#039;git checkout -b BranchImWorkingOn&#039;&#039;&#039;. The branch name should reflect what you plan on doing. Never use directly the &#039;&#039;master&#039;&#039; branch !! &amp;lt;br&amp;gt; You can check this worked with &#039;&#039;&#039;git branch -ra&#039;&#039;&#039; that lists local &amp;amp; remote branches.&lt;br /&gt;
# Make your new scripts or modifications&lt;br /&gt;
# If you made new files/directories, prepare them for atomic commit: &#039;&#039;&#039;git add -N myfile&#039;&#039;&#039;&lt;br /&gt;
# Select the chunks of code you want to commit: &#039;&#039;&#039;git add -p&#039;&#039;&#039; , (explanations of letters [[ManageAllAdminScriptsWithGit#Summary_Of_Git_Commands|here]]). Be as small and precise as possible for each commits.&lt;br /&gt;
# If you want to revert the adding chunks to the index you just did : &#039;&#039;&#039;git reset -p&#039;&#039;&#039;&lt;br /&gt;
# Do the commit for the chunks you selected &#039;&#039;&#039;git commit -m&amp;quot;short message&amp;quot; [-m&amp;quot;long message&amp;quot;]&#039;&#039;&#039;. The long message is optional.&lt;br /&gt;
# To list the last commits &#039;&#039;&#039;git log -NumberOfCommits&#039;&#039;&#039;&lt;br /&gt;
# Last, you need to push the local changes to the remote server : &#039;&#039;&#039;git push origin BranchImWorkingOn&#039;&#039;&#039;&lt;br /&gt;
# Make a merge request on the [http://git.iihe.ac.be git site]: &amp;lt;br&amp;gt;[[File: git_merge_request.png|700px|center]]&lt;br /&gt;
&lt;br /&gt;
==== Summary Of Git Commands ====&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
# First&lt;br /&gt;
git clone git.blahblah&lt;br /&gt;
git checkout -b newbranch   ==&amp;gt; making a new branch from master. Safe, will not delete files.&lt;br /&gt;
&lt;br /&gt;
git branch -ra ==&amp;gt; lists repo &amp;amp; local branches&lt;br /&gt;
git push origin --delete mybranch  ==&amp;gt; will delete remote branch &lt;br /&gt;
&lt;br /&gt;
# Atomic commit (by hunks):&lt;br /&gt;
git add -N myfile #first prepare file before patchadding&lt;br /&gt;
git add -p&lt;br /&gt;
  y - indexer cette partie&lt;br /&gt;
  n - ne pas indexer cette partie&lt;br /&gt;
  a - indexer cette partie et toutes celles restantes dans ce fichier&lt;br /&gt;
  d - ne pas indexer cette partie ni aucune de celles restantes dans ce fichier&lt;br /&gt;
  g - sélectionner une partie à voir&lt;br /&gt;
  / - chercher une partie correspondant à la regexp donnée&lt;br /&gt;
  j - laisser cette partie non décidée, voir la prochaine partie non encore décidée&lt;br /&gt;
  J - laisser cette partie non décidée, voir la prochaine partie&lt;br /&gt;
  k - laisser cette partie non décidée, voir la partie non encore décidée précédente&lt;br /&gt;
  K - laisser cette partie non décidée, voir la partie précédente&lt;br /&gt;
  s - couper la partie courante en parties plus petites&lt;br /&gt;
  e - modifier manuellement la partie courante&lt;br /&gt;
  ? - afficher l&#039;aide&lt;br /&gt;
&lt;br /&gt;
  ==&amp;gt; this add hunks to index, needs git commit -m&amp;quot;&amp;quot;&lt;br /&gt;
git reset -p   ==&amp;gt; revert the adding to index&lt;br /&gt;
&lt;br /&gt;
git commit -m&amp;quot;&amp;quot; [-m&amp;quot;long message&amp;quot;]&lt;br /&gt;
&lt;br /&gt;
git diff          ==&amp;gt; local / index&lt;br /&gt;
git diff HEAD     ==&amp;gt; local / repo&lt;br /&gt;
git diff --cached ==&amp;gt; index / repo&lt;br /&gt;
&lt;br /&gt;
git log -3  ==&amp;gt; lists commit&lt;br /&gt;
&lt;br /&gt;
# Pushing&lt;br /&gt;
git push origin newbranch   ==&amp;gt; pushes local branch to repo &amp;amp; creates new repo branch&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Adding /scripts to a machine and getting access to it ===&lt;br /&gt;
* You need to mount with nfs (or autofs) the /storage of tesla&lt;br /&gt;
* Then create a symlink from &#039;&#039;&#039;/storage_mnt/storage/group/admin/iihe-scripts&#039;&#039;&#039; to &#039;&#039;&#039;/scripts&#039;&#039;&#039;&lt;br /&gt;
* Or you can just include the template in &#039;&#039;&#039;config/nfs/storage&#039;&#039;&#039; in your machine, which does all this.&lt;br /&gt;
* It is in read-only for non-root users of the &#039;&#039;&#039;admins&#039;&#039;&#039; group, so you cannot make any modifications there. &lt;br /&gt;
* If you want access as non-root user, you need to add yourself in [http://freeipa.wn.iihe.ac.be freeipa] [accessible through tunnel only] to the &#039;&#039;&#039;admins&#039;&#039;&#039; group.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
=== Old Wiki page for using SVN ===&lt;br /&gt;
*[[ManageAllAdminScriptsWithSVN| here]]&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=614</id>
		<title>Getting a certificate for the T2</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=614"/>
		<updated>2016-01-22T16:24:14Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;If you need grid access on the T2, please follow all the steps below: &amp;lt;br&amp;gt; &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== First time certificate ===&lt;br /&gt;
# [[Obtaining_a_certificate | Get a certificate (new)]]. &#039;&#039;If you already have a CERN grid certificate, you can temporarily use it. But for accounting reasons, we need you to get a Belgian certificate &#039;&#039;&lt;br /&gt;
# Extract your certificate from your browser: &#039;&#039;&#039;[[Chrome]]&#039;&#039;&#039; | &#039;&#039;&#039;[[Firefox]]&#039;&#039;&#039;&lt;br /&gt;
# [[certificate_to_UI | Put your certificate on the UIs]]&lt;br /&gt;
# [[Register_to_the_CMS_VO|Register to the CMS VO]]&lt;br /&gt;
# [[SiteDB | Check if your certificate is ok on SiteDB]]. Note the DN.&lt;br /&gt;
# Send a mail to the T2B support (grid_adminATlistserv.vub.ac.be) with your DN in order to have write access on the T2.&lt;br /&gt;
# [[CERN_certificate_management | Check that your certificate is the only one registered on the CERN website.]]&lt;br /&gt;
# [[Check_Certificate_UIs | Check if everything works fine on the mX machines]]&lt;br /&gt;
&lt;br /&gt;
=== Certificate renewal ===&lt;br /&gt;
As we switched the company who provides us with certificates, you need to follow the &#039;First time certificate&#039; to have a new one, this until December 2016 at least.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=613</id>
		<title>Getting a certificate for the T2</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=613"/>
		<updated>2016-01-22T15:05:17Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;If you need grid access on the T2, please follow all the steps below: &amp;lt;br&amp;gt; &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== First time certificate ===&lt;br /&gt;
# [[Obtaining_a_certificate | Get a certificate (new)]]. &#039;&#039;If you already have a CERN grid certificate, you can temporarily use it. But for accounting reasons, we need you to get a Belgian certificate &#039;&#039;&lt;br /&gt;
# Extract your certificate from your browser: &#039;&#039;&#039;[[Chrome]]&#039;&#039;&#039; | &#039;&#039;&#039;[[Firefox]]&#039;&#039;&#039;&lt;br /&gt;
# [[certificate_to_UI | Put your certificate on the UIs]]&lt;br /&gt;
# [[Register_to_the_CMS_VO|Register to the CMS VO]]&lt;br /&gt;
# [[SiteDB | Check if your certificate is ok on SiteDB]]. Note the DN.&lt;br /&gt;
# Send a mail to the T2B support (grid_adminATlistserv.vub.iihe.ac.be) with your DN in order to have write access on the T2.&lt;br /&gt;
# [[CERN_certificate_management | Check that your certificate is the only one registered on the CERN website.]]&lt;br /&gt;
# [[Check_Certificate_UIs | Check if everything works fine on the mX machines]]&lt;br /&gt;
&lt;br /&gt;
=== Certificate renewal ===&lt;br /&gt;
As we switched the company who provides us with certificates, you need to follow the &#039;First time certificate&#039; to have a new one, this until December 2016 at least.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=612</id>
		<title>Getting a certificate for the T2</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Getting_a_certificate_for_the_T2&amp;diff=612"/>
		<updated>2016-01-22T15:04:38Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;If you need grid access on the T2, please follow all the steps below: &amp;lt;br&amp;gt; &amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== First time certificate ===&lt;br /&gt;
# [[Obtaining_a_certificate | Get a certificate (new)]]. &#039;&#039;If you already have a CERN grid certificate, you can temporarily use it. But for accounting reasons, we need you to get a Belgian certificate &#039;&#039;&lt;br /&gt;
# Extract your certificate from your browser: &#039;&#039;&#039;[[Chrome]]&#039;&#039;&#039; | &#039;&#039;&#039;[[Firefox]]&#039;&#039;&#039;&lt;br /&gt;
# [[certificate_to_UI | Put your certificate on the UIs]]&lt;br /&gt;
# [[Register_to_the_CMS_VO|Register to the CMS VO]]&lt;br /&gt;
# [[SiteDB | Check if your certificate is ok on SiteDB]]. Note the DN.&lt;br /&gt;
# Send a mail to grid_adminATlistserv.vub.iihe.ac.be with your DN in order to have write access on the T2.&lt;br /&gt;
# [[CERN_certificate_management | Check that your certificate is the only one registered on the CERN website.]]&lt;br /&gt;
# [[Check_Certificate_UIs | Check if everything works fine on the mX machines]]&lt;br /&gt;
&lt;br /&gt;
=== Certificate renewal ===&lt;br /&gt;
As we switched the company who provides us with certificates, you need to follow the &#039;First time certificate&#039; to have a new one, this until December 2016 at least.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=611</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=611"/>
		<updated>2016-01-21T15:46:00Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
:: The &#039;&#039;&#039;File Server&#039;&#039;&#039; provides the user home on the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity with regular backups.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Mass Storage&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt; &amp;lt;span style=&#039;color:red&#039;&amp;gt;01/2016 : Not enabled yet&amp;lt;/span&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2344&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 768&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=610</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=610"/>
		<updated>2016-01-21T15:07:26Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Job management */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Storage Machines&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
:* The &#039;&#039;&#039;User Storage&#039;&#039;&#039;: it provides the home of the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt; &amp;lt;span style=&#039;color:red&#039;&amp;gt;01/2016 : Not enabled yet&amp;lt;/span&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2344&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 768&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=609</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=609"/>
		<updated>2016-01-21T15:06:02Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Usefull links */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Storage Machines&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
:* The &#039;&#039;&#039;User Storage&#039;&#039;&#039;: it provides the home of the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt; &amp;lt;span style=&#039;color:red&#039;&amp;gt;01/2016 : Not enabled yet&amp;lt;/span&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2344&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 768&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[http://gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=608</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=608"/>
		<updated>2016-01-21T15:05:47Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Usefull links */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Storage Machines&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
:* The &#039;&#039;&#039;User Storage&#039;&#039;&#039;: it provides the home of the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt; &amp;lt;span style=&#039;color:red&#039;&amp;gt;01/2016 : Not enabled yet&amp;lt;/span&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2344&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 768&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;br /&gt;
[gangliat2b.iihe.ac.be/ganglia/  Ganglia Monitoring] : status of all our servers.&amp;lt;br&amp;gt;&lt;br /&gt;
[http://mon.iihe.ac.be/jobview/overview.html  JobView Monitoring ] : summary of the cluster usage.&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=607</id>
		<title>Cluster Overview</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=Cluster_Overview&amp;diff=607"/>
		<updated>2016-01-21T14:54:33Z</updated>

		<summary type="html">&lt;p&gt;Romain Rougny: /* Backup */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;== Overview ==&lt;br /&gt;
&lt;br /&gt;
The cluster is composed 3 groups of machines :&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;User Interfaces (UI)&#039;&#039;&#039;&lt;br /&gt;
::This is the cluster front-end, to use the cluster, you need to log into those machines&lt;br /&gt;
::::Servers : [ m0 , m1 , m2 , m3 ] , [ m5 , m6 , m7 , m8 , m9 ]&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Computing Machines :&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Computing Element (CE):&#039;&#039;&#039; This server is the brain of the batch system : it manages all the submitted jobs, and send them to the worker nodes.&lt;br /&gt;
::::Servers : cream02&lt;br /&gt;
&lt;br /&gt;
:* The &#039;&#039;&#039;Worker Nodes (WN): &#039;&#039;&#039; This is the power of the cluster : they run multiple jobs in parallel and send the results &amp;amp; status back to the CE.&lt;br /&gt;
::::Servers : nodeXX-YY&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
* The &#039;&#039;&#039;Storage Machines&#039;&#039;&#039;&lt;br /&gt;
** The &#039;&#039;&#039;Storage Element&#039;&#039;&#039;: it is the brain of the cluster storage. Grid accessible, it knows where all the files are, and manages all the storage nodes.&lt;br /&gt;
::::Server : maite&lt;br /&gt;
:* The &#039;&#039;&#039;Storage Nodes&#039;&#039;&#039;: This is the memory of the cluster : they contain big data files. In total, they provide ~2300 TB of grid-accessible storage.&lt;br /&gt;
::::Servers : beharXXX&lt;br /&gt;
:* The &#039;&#039;&#039;User Storage&#039;&#039;&#039;: it provides the home of the UIs. It is a highly efficient &amp;amp; redundant storage node of ~70 TB capacity.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== How to Connect ==&lt;br /&gt;
&lt;br /&gt;
To connect to the cluster, you need to have sent us your public ssh key.&lt;br /&gt;
In a terminal, type the following:&lt;br /&gt;
 ssh -X -o ServerAliveInterval=100 username@m0.iihe.ac.be&lt;br /&gt;
:&#039;&#039;Tip: the &#039;&#039;-o ServerAliveInterval=100&#039;&#039; option is used to keep your session alive for a long period of time ! You should not be disconnected during a whole day of work.&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
After a successful login, you&#039;ll see this message :&lt;br /&gt;
 &amp;lt;span style=&#039;color:green&#039;&amp;gt;&lt;br /&gt;
          @@@@@@@@     @@@@             @@@@@     @@@@@@@&lt;br /&gt;
             @@       @    @            @@   @    @@&lt;br /&gt;
             @@            @    @@@@    @@@@@     @@@@&lt;br /&gt;
             @@         @@              @@    @   @@@@&lt;br /&gt;
             @@       @                 @@    @   @@&lt;br /&gt;
             @@       @@@@@@            @@@@@@    @@@@@@@&lt;br /&gt;
                               @ IIHE   &lt;br /&gt;
 &amp;lt;/span&amp;gt;&lt;br /&gt;
   Welcome to the t2b cluster ! You are on the following UI: m2 &amp;lt;br&amp;gt;&lt;br /&gt;
   You can find more info on our wiki page: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;http://t2bwiki.iihe.ac.be&amp;lt;/span&amp;gt;&lt;br /&gt;
            To contact us: &amp;lt;span style=&#039;color:blue&#039;&amp;gt;grid_admin@listserv.vub.ac.be&amp;lt;/span&amp;gt;&amp;lt;br&amp;gt;&lt;br /&gt;
   Please remember this machine will allow you only &amp;lt;span style=&#039;color:red&#039;&amp;gt;600s (10 minutes)&amp;lt;/span&amp;gt;&lt;br /&gt;
      of cpu time per processes.&lt;br /&gt;
  ________________________________________________________________________&lt;br /&gt;
                   Your Quota on /user: &amp;lt;span style=&#039;color:green&#039;&amp;gt;43% used&amp;lt;/span&amp;gt; (282G left) &lt;br /&gt;
 There are &amp;lt;span style=&#039;color:green&#039;&amp;gt;2 users&amp;lt;/span&amp;gt; here   |   Load: &amp;lt;span style=&#039;color:red&#039;&amp;gt;7.51 /8 CPUs (2%)&amp;lt;/span&amp;gt;  |   Mem: &amp;lt;span style=&#039;color:orange&#039;&amp;gt;80% used&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Please observe all the information in this message:&lt;br /&gt;
* The wiki link, where you should go first to find the information&lt;br /&gt;
* The email used for the cluster support (please use this one rather than personal mail, this way everyone on the support team can answer and track the progress.)&lt;br /&gt;
* The cpu time limit imposed per process, as we divided our UIs into 2 groups.&lt;br /&gt;
:: &#039;&#039;&#039;The light task&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 10 minutes)&amp;lt;/span&amp;gt; : they are used for crab/local job submission, writing code, building debugging ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m0.iihe.ac.be, m1.iihe.ac.be, m2.iihe.ac.be, m3.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
:: &#039;&#039;&#039;The CPU-intensive&#039;&#039;&#039; UIs &amp;lt;span style=&#039;color:red&#039;&amp;gt;(max &#039;&#039;&#039;CPU&#039;&#039;&#039; time = 5 hours)&amp;lt;/span&amp;gt; : they are available for CPU-intensive and long tasks, although you should prefer using local job submission ...&lt;br /&gt;
::&amp;lt;pre&amp;gt;m5.iihe.ac.be, m6.iihe.ac.be, m7.iihe.ac.be, m8.iihe.ac.be and m9.iihe.ac.be&amp;lt;/pre&amp;gt;&lt;br /&gt;
* The quota you have left on /user&lt;br /&gt;
* Information about how heavily this UI is used. If any of them is red (ie above optimal usage), please consider using another UI. Please be mindful of other users and don&#039;t start too many processes, epsecially if the UI is already under charge.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Data Storage &amp;amp; Directory Structure ==&lt;br /&gt;
&lt;br /&gt;
There are 2 main directories to store your work and data:&lt;br /&gt;
* &#039;&#039;&#039;/user [/$USER]&#039;&#039;&#039; : this is your home directory. You have an enforced quota there, as it is an expensive storage with redundancy and daily backups.&lt;br /&gt;
* &#039;&#039;&#039;/pnfs [/iihe/cms/store/user/$USER]&#039;&#039;&#039; : this is where you can store a large amount of data, and is also [[GridStorageAccess|grid-accessible]]. If you need more than 2 TB, please contact us. THere is no backups there, so be careful of what you do !&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
There are other directories than you might want to take notice of:&lt;br /&gt;
* &#039;&#039;&#039;/group&#039;&#039;&#039; : same as /user , but if you need to share/produce in a group.&lt;br /&gt;
* &#039;&#039;&#039;/scratch&#039;&#039;&#039; : a temporary scratch space for your job. Use $TMPDIR on the WNs, it is cleanned after each job :)&lt;br /&gt;
* &#039;&#039;&#039;/cvmfs&#039;&#039;&#039; : Centralised CVMFS software repository from CERN. It should contain most of the software you will need.&lt;br /&gt;
* &#039;&#039;&#039;/swmgrs&#039;&#039;&#039; : local area for shared software not in /cvmfs . You can use a [[OtherSoftware|nice tool]] to find the software and versions available.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
== Batch System ==&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
=== Queues ===&lt;br /&gt;
&lt;br /&gt;
The cluster is decomposed in queues&lt;br /&gt;
&lt;br /&gt;
{| width=&amp;quot;1064&amp;quot; cellspacing=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; border=&amp;quot;1&amp;quot; align=&amp;quot;center&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | &amp;lt;br&amp;gt;&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | localgrid&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | highmem&lt;br /&gt;
! scope=&amp;quot;col&amp;quot; | express&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Description&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | default queue, all available nodes except express&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Subgroup of localgrid with WNs having 4GB Mem / Slot&amp;lt;br&amp;gt; &amp;lt;span style=&#039;color:red&#039;&amp;gt;01/2016 : Not enabled yet&amp;lt;/span&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | Limited walltime&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | # CPU&#039;s (Jobs)&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2344&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 768&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 16&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Walltime limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot; | 95 hours &lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; |20 minutes &amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Memory limit&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 4 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 2 Gb&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs running / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 400 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 3 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
! scope=&amp;quot;row&amp;quot; | Max # jobs sent to the batch system / User&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; colspan=&amp;quot;2&amp;quot;| 2500 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
| nowrap=&amp;quot;nowrap&amp;quot; align=&amp;quot;center&amp;quot; | 100 jobs&amp;lt;br&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job submission ===&lt;br /&gt;
&lt;br /&gt;
To submit a job, you just have to use the &#039;&#039;&#039;qsub&#039;&#039;&#039; command&amp;amp;nbsp;:&lt;br /&gt;
&amp;lt;pre&amp;gt;qsub myjob.sh&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
*-q queueName&amp;amp;nbsp;: choose the queue (default: any)&lt;br /&gt;
&lt;br /&gt;
*-N jobName&amp;amp;nbsp;: name of the job&lt;br /&gt;
&lt;br /&gt;
*-I&amp;amp;nbsp;: (capital i) pass in interactive mode&lt;br /&gt;
&lt;br /&gt;
*-m mailaddress&amp;amp;nbsp;: set mail address (use in conjonction with -m) : MUST be @ulb.ac.be or @vub.ac.be&lt;br /&gt;
&lt;br /&gt;
*-m [a|b|e]&amp;amp;nbsp;: send mail on job status change (a = aborted , b = begin, e = end)&lt;br /&gt;
&lt;br /&gt;
*-l&amp;amp;nbsp;: resources options&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Job management ===&lt;br /&gt;
&lt;br /&gt;
To see all jobs (running / queued), you can use the &#039;&#039;&#039;qstat&#039;&#039;&#039; command, or go to the [http://mon.iihe.ac.be/jobview/overview.html JobView] page to have a summary of what&#039;s running.&lt;br /&gt;
&amp;lt;pre&amp;gt;qstat&lt;br /&gt;
&amp;lt;/pre&amp;gt; &lt;br /&gt;
&#039;&#039;OPTIONS&#039;&#039;&lt;br /&gt;
&lt;br /&gt;
* -u username&amp;amp;nbsp;: list only jobs submitted by username&lt;br /&gt;
&lt;br /&gt;
* -n&amp;amp;nbsp;: show nodes where jobs are running&lt;br /&gt;
&lt;br /&gt;
* -q&amp;amp;nbsp;: show the job repartition on queues&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&#039;&#039;&#039;If you want to test the batch system, you can follow the workbook [[LocalSubmission|here]]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Backup ==&lt;br /&gt;
&#039;&#039;&#039;Backup of /user &amp;amp; /group&#039;&#039;&#039;&amp;lt;br&amp;gt;&lt;br /&gt;
There is no way for users to access the backups. Just send a mail to grid_admin@listserv.vub.iihe.ac.be specifying the file(s)/folder(s) you want, and the date you want it from.&lt;br /&gt;
&lt;br /&gt;
Our zfs server allows us to do regular snapshots:&lt;br /&gt;
* There is one done everyday at midnight, we keep the last seven of those (so a complete week).&lt;br /&gt;
* We also have one snapshot per week, and keep the last 4 of them (so a month).&lt;br /&gt;
* Additionnaly, the /user is backed up every day to a secondary machine, in case our main fileserver goes down.&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Usefull links ==&lt;/div&gt;</summary>
		<author><name>Romain Rougny</name></author>
	</entry>
</feed>