<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en-GB">
	<id>https://t2bwiki.iihe.ac.be/index.php?action=history&amp;feed=atom&amp;title=PutClusterOn</id>
	<title>PutClusterOn - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://t2bwiki.iihe.ac.be/index.php?action=history&amp;feed=atom&amp;title=PutClusterOn"/>
	<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=PutClusterOn&amp;action=history"/>
	<updated>2026-04-20T09:47:39Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.5</generator>
	<entry>
		<id>https://t2bwiki.iihe.ac.be/index.php?title=PutClusterOn&amp;diff=226&amp;oldid=prev</id>
		<title>Maintenance script: Created page with &quot; == How to put the cluster back on after a shutdown== PageOutline  === Start all storage === Machines to put on:&lt;br&gt; All behars&lt;br&gt;    careful: for all HP machines, ...&quot;</title>
		<link rel="alternate" type="text/html" href="https://t2bwiki.iihe.ac.be/index.php?title=PutClusterOn&amp;diff=226&amp;oldid=prev"/>
		<updated>2015-08-26T12:28:59Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot; == How to put the cluster back on after a shutdown== &lt;a href=&quot;/index.php?title=PageOutline&amp;amp;action=edit&amp;amp;redlink=1&quot; class=&quot;new&quot; title=&quot;PageOutline (page does not exist)&quot;&gt;PageOutline&lt;/a&gt;  === Start all storage === Machines to put on:&amp;lt;br&amp;gt; All behars&amp;lt;br&amp;gt;    careful: for all HP machines, ...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;br /&gt;
== How to put the cluster back on after a shutdown==&lt;br /&gt;
[[PageOutline]]&lt;br /&gt;
&lt;br /&gt;
=== Start all storage ===&lt;br /&gt;
Machines to put on:&amp;lt;br&amp;gt;&lt;br /&gt;
All behars&amp;lt;br&amp;gt;&lt;br /&gt;
   careful: for all HP machines, first put on the JBOD, wait for 2 minutes and only then put on the servers.&amp;lt;br&amp;gt;&lt;br /&gt;
   NOTE: at the last startup, after having put on the JBODs, I found the servers on also, but the raids were mounted, so no problem.&lt;br /&gt;
freenas (aka behar062)&amp;lt;br&amp;gt;&lt;br /&gt;
nexenta&amp;lt;br&amp;gt;&lt;br /&gt;
jefke.wn&amp;lt;br&amp;gt;&lt;br /&gt;
fs.wn&amp;lt;br&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Start some servers ===&lt;br /&gt;
First, wait for all storage to be back&amp;lt;br&amp;gt;&lt;br /&gt;
Then start putting some servers back online&amp;lt;br&amp;gt;&lt;br /&gt;
*the dom&amp;#039;s&lt;br /&gt;
*ccq&lt;br /&gt;
*the M machines&lt;br /&gt;
&lt;br /&gt;
=== Doms and VMs  ===&lt;br /&gt;
As some servers were put off that are virtual, you need to log in to the corresponding dom to restart it in virt-manager.&amp;lt;br&amp;gt;&lt;br /&gt;
From one dom machine, you should normally be able to connect to all the other dom&amp;#039;s through virt-manager.&amp;lt;br&amp;gt;&lt;br /&gt;
Concerning the order in which the VMs are to be restarted, you should first start qproxy and quattorrepository. The CEs (cream02) should be the last machine to restart.&amp;lt;br&amp;gt;&lt;br /&gt;
Notice that there could also be other machines not being switched on. Have a look around.&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;lt;br&amp;gt;&lt;br /&gt;
On the machines that were put on automatically, the time will be wrong. This can easily be solved by restarting the ntpd server on all of them:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./distrib_exec_list list_virtual_machines_ntpd &amp;quot;service ntpd restart&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Workernodes ===&lt;br /&gt;
You can now slowly start putting on all the WN&amp;#039;s.&amp;lt;br&amp;gt;&lt;br /&gt;
Do not switch them on at once. Put a few at the time.&lt;br /&gt;
&lt;br /&gt;
=== Start dCache ===&lt;br /&gt;
Got to maite:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
/etc/init.d/pnfs start&lt;br /&gt;
/opt/d-cache/bin/dcache start&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
If there are some behars that need to be put read-only in dcache, do so now.&lt;br /&gt;
on ccq:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./distrib_exec_list list_behars_all &amp;quot;/opt/d-cache/bin/dcache start&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Start phedex ===&lt;br /&gt;
go to frontier&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
su - phedex&lt;br /&gt;
./masterProd start&lt;br /&gt;
./masterDebug start&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== Reopen the queues ===&lt;br /&gt;
go to cream02&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./enable_queues&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== pnfs on the Wn&amp;#039;s ===&lt;br /&gt;
Sometimes, pnfs is not mounted correctly on all the WN&amp;#039;s. Best to issue a mount -a on all the WN&amp;#039;s:&amp;lt;br&amp;gt;&lt;br /&gt;
on ccq:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
./distrib_exec_list node_list_all_wns &amp;quot;mount -a&amp;quot;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{{TracNotice|{{PAGENAME}}}}&lt;/div&gt;</summary>
		<author><name>Maintenance script</name></author>
	</entry>
</feed>