AdminPage: Difference between revisions

From T2B Wiki
Jump to navigation Jump to search
No edit summary
 
(62 intermediate revisions by 8 users not shown)
Line 5: Line 5:
*[[PutClusterOn| How to properly put the cluster on]]
*[[PutClusterOn| How to properly put the cluster on]]
==== CMS Services ====
==== CMS Services ====
*[[OpenID | How to use tokens and openID in the grid]]
*[[Phedex]]
*[[Phedex]]
*[[Heartbeat]]
*[[Heartbeat]]
Line 10: Line 11:
*[[FroNTier]]
*[[FroNTier]]
*[[ProdAgent]]
*[[ProdAgent]]
*[[GitForSiteConf| instructions to commit siteconf to git]]
*[[GitForSiteConf| Instructions to commit siteconf to git]]
 
==== Grid Configuration Issues ====
==== Grid Configuration Issues ====
*[[UpdateCertificates| Update the certificates of all our machines]]
*[[UpdateCertificates| Update the certificates of all our machines]]
*[[CreamIssues| Issues with cream and how to solve them]]
*[[CreamIssues| Issues with cream and how to solve them]]
*[[PBS_TMPDIR| PBS TMPDIR]]
*[[PBS_TMPDIR| PBS TMPDIR]]
*[[APEL]]
*[[APEL| <strike>APEL</strike>(OBSOLETE)]]
*[[BDII]]
*[[BDII]]
*[[FTS]]
*[[FTS]]
*[[SL4_x86_64_WNs| SL4 x86_64 WNs]]
*[[SL4_x86_64_WNs| <strike>SL4 x86_64 WNs</strike>(OBSOLETE)]]
*[[CE_oveloaded| CE overloaded]]
*[[CE_oveloaded| CE overloaded]]
*[[RB]]
*[[RB]]
*[[IPMI]]
*[[IPMI]]
*[[CA_certificates| Upgrade CA certificates]]
*[[CA_certificates| <strike>Upgrade CA certificates</strike> (OBSOLETE)]]
*[[Shutdown| Shutting down the cluster]]
*[[Shutdown| Shutting down the cluster]]
*[[Software_Area_Switch| Software Area Switch]]
*[[Software_Area_Switch| Software Area Switch]]
Line 28: Line 30:
*[[Argus| Argus server and glexec on the workernodes]]
*[[Argus| Argus server and glexec on the workernodes]]
*[[ApelGapPublishing| Apel gap publishing]]
*[[ApelGapPublishing| Apel gap publishing]]
*[[UpdateCACertificates| Update IGTF CA certificates]]
==== Files section ====
==== Files section ====
*[[DCache| dCache]]
*[[DCache| dCache]]
Line 39: Line 43:
*[[GetLostFiles| Retrieve lost files from datasets]]
*[[GetLostFiles| Retrieve lost files from datasets]]
*[[StorageConsistency| Storage Consistency]]
*[[StorageConsistency| Storage Consistency]]
*[[Rucio | rucio commands ]]


==== Status and Monitoring ====
==== Status and Monitoring ====
Line 61: Line 66:
*[[NetworkSetup| Network Setup]]
*[[NetworkSetup| Network Setup]]
*[[SetupMonitoringControlerSunfireV20z| Setup Monitoring of LSI Disk Controler on Sunfire V20z Server]]
*[[SetupMonitoringControlerSunfireV20z| Setup Monitoring of LSI Disk Controler on Sunfire V20z Server]]
*[[LDAP_UCL_IIHE| LDAP authentication system for the replication between UCL and IIHE sites]]
*[[LDAP_UCL_IIHE| <strike>LDAP authentication system for the replication between UCL and IIHE sites</strike> (OBSOLETE)]]
*[[GridAdminSurvivalGuide| IIHE Grid-admin survival guide]]
*[[GridAdminSurvivalGuide| IIHE Grid-admin survival guide]]
*[[Solaris| Solaris 10]]
*[[Solaris| Solaris 10]]
*[[SolarisSSD| Adding an SSD card and configuring RAID, zpools, filesystems and shares on the new Solaris fileserver]]
*[[SolarisSSD| Adding an SSD card and configuring RAID, zpools, filesystems and shares on the new Solaris fileserver]]
*[[LinuxAdminTricks| Linux tricks for admins]]
*[[LinuxAdminTricks| Linux tricks for admins]]
*[[CrabLocalPbsSubmission| How to implement local PBS submission with CRAB ?]]
*[[CrabLocalPbsSubmission| <strike>How to implement local PBS submission with CRAB ?</strike>(OBSOLETE)]]
*[[AddNewUserFromUCLToLDAP| How to create an account for a CMS user from UCL ?]]
*[[AddNewUserFromUCLToLDAP| <strike>How to create an account for a CMS user from UCL ?</strike>(OBSOLETE)]]
*[[OSErrata| Deploying OS errata]]
*[[OSErrata| <strike>Deploying OS errata</strike>(OBSOLETE)]]
*[[BenchmarkHEPSPEC06| Howto benchmark a node with HEPSPEC06]]
*[[BenchmarkHEPSPEC06| Howto benchmark a node with HEPSPEC06]]
*[[Installing_dcache_pool| Install a new Dcache pool]]
*[[Installing_dcache_pool| Install a new dCache pool]]
*[[BackupUsersHomeDirs| Backup of the users home dirs on Jefke]]
*[[BackupUsersHomeDirs| Backup of the users home dirs on Jefke]]
*[[MonWebServicesMigration| Migration of mon and its Web services]]
*[[MonWebServicesMigration| Migration of mon and its Web services]]
Line 79: Line 84:
**[[KernelUpdate| Reboot after a kernel update]]
**[[KernelUpdate| Reboot after a kernel update]]
**[[UpgradeWNstoSL5.5| Reboot after an OS upgrade]]
**[[UpgradeWNstoSL5.5| Reboot after an OS upgrade]]
*[[ManageAllAdminScriptsWithSVN| Central management of all the admin scripts with SVN]]
** Force reboot a WN with hanging nfs: echo 1 > /proc/sys/kernel/sysrq ;echo b > /proc/sysrq-trigger
** Force shutdown a WN with hanging nfs: echo 1 > /proc/sys/kernel/sysrq ;echo o > /proc/sysrq-trigger
*[[ManageAllAdminScriptsWithGit| Central management of all the admin scripts with Git]]
*[[HelpPageForAllScripts|Help page for all iihe scripts]]
*[[ConfigProxyCvmfs| Configuration of a proxy for CVMFS]]
*[[ConfigProxyCvmfs| Configuration of a proxy for CVMFS]]
**[[RecoverCvmfs| How to recover CVMFS]]
**[[RecoverCvmfs| How to recover CVMFS]]
Line 90: Line 98:
*[[CCMWithKerberos| Experimental : Securing profiles with Kerberos]]
*[[CCMWithKerberos| Experimental : Securing profiles with Kerberos]]
*[[MigrateToMediaWiki| Migration of T2B Wiki from Trac to MediaWiki]]
*[[MigrateToMediaWiki| Migration of T2B Wiki from Trac to MediaWiki]]
*[[motd|Message Of The Day (motd)]]
*[[LToS| Support of Long-tail of Science]]
*[[QueryingBDII| Querying BDII]]
*[[MachinePrivateCertWithEL7| Machine private certificate with EL7]]
*[[ClusterUsageAccountingStatistics| Cluster usage accounting statistics]]
*[[SingularityContainerCreation | Singularity container creation]]
*[[ExplainingApel | Explaining the Apel accounting system]]


==== Quattor ====
==== Quattor ====
*[http://quattor.begrid.be/trac/centralised-begrid-v5/wiki/BEgridAndQuattor BEgrid wiki]
 
*[[Test_things| Test things]]
*[[AideMemoire| FAQ - Aide-mémoire - Howtos]]
*[[Lemon_installation| Lemon installation]]
*[[ManageRepositoriesWithQuattor|Manage repositories with quattor]]
*[[QuattorPointers| Pointers]] to more in-depth information on quattor
*[[AddingMachineToCluster| Adding]] a new machine to the cluster
*[[AutomaticMachineTemplateGeneration| Automatic generation of hardware and profile templates for new workernodes]]
*[[InstallationBEgridClient0| Installation of a Quattor deployment server release 13.1]]
*[[InstallFilesNewOS| How to add a new OS to the Quattor Repository]]
*[[GenerateRPMFromATagInGithub| How to build an RPM from a tag in Github]]
*[[GenerateRPMFromATagInGithub| How to build an RPM from a tag in Github]]
*[[HowtoMigrateWNToCB9| How to migrate workernodes from CB8 to CB9]]
*[[WorkingInCB9| Working in CB9 (Quattor release >= 14.2)]]
*[[WorkingInCB9| Working in CB9 (Quattor release >= 14.2)]]
*[[AideMemoire| FAQ - Aide-mémoire - Howtos]]
*[[AddNewQuattorVersion|How to add a new version of quattor in our scdb]]
*[[BuildANewPysvnOnAiiServer| Howto build a new pysvn on a SL63 AII server]]
*[[QuattorFreeIPA| Quattor and FreeIPA]]
*[[QuattorFreeIPA| Quattor and FreeIPA]]
*[[NewRuncheck| Rewrite of the runcheck script in Perl]]
*[[HardDisksManagement| Hard disks management]]
*[[HardDisksManagement| Hard disks management]]
*[[Metaconfig|How to use metaconfig (with examples)]]
*[[Aquilon| Aquilon]]
*[http://quattor.begrid.be/trac/centralised-begrid-v5/wiki/BEgridAndQuattor <strike>BEgrid wiki</strike>(OBSOLETE)]
*[[Test_things| <strike>Test things</strike>(OBSOLETE)]]
*[[Lemon_installation| <strike>Lemon installation</strike>(OBSOLETE)]]
*[[QuattorPointers| <strike>Pointers</strike>]]<strike> to more in-depth information on quattor</strike>(OBSOLETE)
*[[AddingMachineToCluster| <strike>Adding</strike>]]<strike> a new machine to the cluster</strike>(OBSOLETE)
*[[AutomaticMachineTemplateGeneration|<strike>Automatic generation of hardware and profile templates for new workernodes</strike>]](OBSOLETE: use script create_wn)
*[[InstallationBEgridClient0|<strike>Installation of a Quattor deployment server release 13.1</strike>]](OBSOLETE: see quattor template for aii server)
*[[InstallFilesNewOS|<strike> How to add a new OS to the Quattor Repository</strike>]](OBSOLETE)
*[[HowtoMigrateWNToCB9|<strike>How to migrate workernodes from CB8 to CB9</strike>]](HISTORICAL)
*[[BuildANewPysvnOnAiiServer|<strike>Howto build a new pysvn on a SL63 AII server</strike>]](HISTORICAL)
==== FreeIPA ====
*[[FixIPAcert|Fix IPA client certificates]]


==== KVM virtualization ====
==== KVM virtualization ====
Line 118: Line 142:
*[[MigrationToOpenNebula| Transforming the KVM hypervisors farm into an OpenNebula cloud]]
*[[MigrationToOpenNebula| Transforming the KVM hypervisors farm into an OpenNebula cloud]]
*[[WorkingInT2BCloud| Working in the T2B cloud]]
*[[WorkingInT2BCloud| Working in the T2B cloud]]
*[[MigrateDBMySQL| Migrate one DB from sqlite to mysql]]
*[[BackupT2BCloud| Backup of the T2B Cloud]]
*[[DealingWithiPXE| Dealing with iPXE]]
*[[ResizingVMDisk| Resizing the drive of a VM]]
*[[RestoringCloudFrontendFromBackup| Restoring an OpenNebula frontend from a backup]]
==== Clouds for users ====
*[[VUB-ULB cloud]]
*[[BEgrid cloud (part of FedCloud)]]


==== gUSE/WS-PGRADE portal ====
==== gUSE/WS-PGRADE portal ====
Line 133: Line 166:


==== CEPH ====
==== CEPH ====
*[[ExperimentsWithCeph| Experiments with CEPH]]
SEE PRIVATE WIKI
 
==== CEPH Old (deprecated) ====
*[[UnderstandingCeph| Understanding Ceph]]
*[[InstallCephWithQuattor| Installing Ceph with Quattor]]
*[[ExperimentsWithCeph| Experiments with Ceph]]
*[[CephBasics| Operating a Ceph cluster]]
*[[Deploying_a_new_Ceph_Octopus_cluster| Deploying a new Ceph Octopus cluster]]
*[[Mounting_a_RBD_on_a_client_machine | Mounting a RBD on a client machine]]
*[[CephCrushMap | Manage the Crush map]]
*[[CephFS | Manage CephFS]]
 
==== Logstash / Elasticsearch / Kibana (ELK) ====
machine: log10 | [http://log10.iihe.ac.be/index.html interface]  |  [http://log10.iihe.ac.be/HQ index manager]
* [[log_forwarding_with_quattor|Forwarding a log with rsyslog to logstash using quattor]]
* [[log_parsing_with_logstash|Parsing the logs with logstash]]
 


==== Network ====
* [[network_bond_and_tag|Bonding of 2 interfaces + tagging of 2 vlans on the bond (PRIV+PUB)|]]
* [[huawei_switch|Managing the Huawei CE8850-32CQ-EI 100G switch]]


{{TracNotice|{{PAGENAME}}}}
==== HTCondor clusters ====
* [[htc_test_local|Testing local submission]]
* [[htc_test_grid|Testing grid submission]]
* [[htc_cheat_sheet|HTCondor cheat sheet]]
* [[htc_python_binding|HTCondor Python binding]]

Latest revision as of 12:09, 25 July 2024

Management of the whole cluster

CMS Services

Grid Configuration Issues

Files section

Status and Monitoring

Info

Quattor


FreeIPA

KVM virtualization

T2B Cloud

Clouds for users

gUSE/WS-PGRADE portal

Migration to EMI-3

XEN

CEPH

SEE PRIVATE WIKI

CEPH Old (deprecated)

Logstash / Elasticsearch / Kibana (ELK)

machine: log10 | interface | index manager


Network

HTCondor clusters