Moving clusters
From Molecular Modeling Wiki
Warning
Due to the security problems, the SSH server keys will be regenerated on most cluster servers. This means that you can receive a warning similar to the following when trying to log in for the first time:
@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ @ WARNING: REMOTE HOST IDENTIFICATION HAS CHANGED! @ @@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ IT IS POSSIBLE THAT SOMEONE IS DOING SOMETHING NASTY! Someone could be eavesdropping on you right now (man-in-the-middle attack)! It is also possible that the RSA host key has just been changed. The fingerprint for the RSA key sent by the remote host is 9a:a3:68:49:29:f6:a0:f4:c1:64:a0:fd:98:67:b2:67. Please contact your system administrator. Add correct host key in /root/.ssh/known_hosts to get rid of this message. Offending key in /root/.ssh/known_hosts:53 RSA host key for iridium has changed and you have requested strict checking. Host key verification failed.
To get rid of this warning and get access to the cluster, do the following:
- Find a row that starts with Offending key ... in the text of the warning and remember line number after a colon (53 in the above example).
- Open the ~/.ssh/known_hosts file delete the line which number was mentioned in the warning (in vi, press <5><3><shift-G><d><d> in case of the above example; replace numbers according to your needs).
- Save the file and try to connect to the cluster.
- If the connection fails again with the same or similar warning, repeat all steps.
Actual status
- centrum
- Server is up and running.
- carol
- Working on ...
- marge
- ... to be moved.
- teogate
- ... to be moved.
- althea
- Server is up and running.
- lithium
- Cluster is up and running, except for clients l07 and l22-l25 that need some repair.
- helium
- Cluster is up and running.
- francium
- Server is up and running.
- There will be more maintenance work on server, so it can be restarted several times; meanwhile you can access your data.
- The cluster is being checked.
- The nodes will not be started before the infiniband switch is returned from a warranty repair - it can take two or three weeks.
- iridium
- Cluster is up and running. Enable queues as needed.
- cobalt
- Not running.
Server will be turned on on Friday.Sorry, I did not manage.
- argon
- Server is up and running.
- There will be more maintenance work on server, so it can be restarted several times; meanwhile you can access your data.
- The cluster is being checked, I hope to start it later on Monday.
- krypton
- Server is up and running.
- There will be more maintenance work on server, so it can be restarted several times; meanwhile you can access your data.
- The cluster is being checked, some cabling will have to be replaced, so I do not expect to run it on Monday.
- radon
- Not running.
Server will be turned on on Friday.Sorry, I did not manage.
- palladium
- Not running.
Server will be turned on on Friday.Sorry, I did not manage.
- vanad
- Discontinued.
- titanium
- Discontinued.
- niob
- Discontinued.