TWiki
>
DistributedComputing Web
>
GridEngine
(08 May 2008,
IainRae
)
(raw view)
E
dit
A
ttach
<!-- This is a comment, in the HTML sense of the word, but Wiki directives still work, so you could put access control statements here to restrict access to your new topic, and they won't show up in the published page. Please think carefully when creating a new topic, and use a GoodWikiName --> This area is for users to share tips and tricks about using the local Grid Engine setup. Local official documentation for Grid Engine can be found at http://www.inf.ed.ac.uk/systems/beowulf/doc/gridengine.html For cluster specific info see [[hermes]], [[lion]], [[lutzow]] or [[townhill]]. ---+The basics * What clusters are available? * In preparation for moving to the forum the clusters have been re-organised into two * The [[desktop beowulf]] machines (GX240/P530) with head nodes lion, lutzow and moselle * The [[rackmount beowulf]] machines (PE1425) with head nodes hermes, townhill and seville * It's expected that the rackmounted machines will be moved to the forum server room within a couple of months of the move to the forum being finished, the older desktop machines will take longer to move. * GridEngine is configured so that the cluster cores each have the same amount of memory (1Gb per core on all the above clusters). Other configurations would be possible - add this to the wish list if you want it. You can monitor the load on the above Informatics compute resources at http://bwganglia.inf.ed.ac.uk/ganglia/ From outside Edinburgh, you can still see this web page, thus: * ssh -L 8000:bwganglia.inf.ed.ac.uk:80 username@ssh.inf.ed.ac.uk * now point your browser at localhost:8000/ganglia * this has stopped working since the move to FC5 (support ticket 30296) * LocalWishList - list improvements you would like to see * UnansweredQuestions - post a question, or see if you can help someone by answering their questions * AnsweredQuestions - when a question from above is answered, move it here * UsingGMTKUnderGridEngine - documentation for the genetic algorithm based triangulation script ---+Misc Tips and tricks * A sample script calling qsub is /group/project/ami3/amiasr_shef/asrcore/tools/submitjob.hermes a not so nice one /home/vstrom/software/cstr/scripts/multisyn_build/bin/do_alignment_parallel Both do not make use of "submit scripts" since changing the file name for stderr and stdout does not work then, i.e. "qsub -o fname" works, "qsub submit_script" with "#$ -o fname" in submit_script does not. -- Main.VolkerStrom - 13 Dec 2005 * Some generic jobs submission advice * Sanity check your environment *before* you start to run your job. If the output is going to a file in ~/longrunning_jobs/results/temp then check that the directory exists and is writable before starting any computation. * Write intermediate and results files to /disk/scratch on the local filesystem then copy them back to your home directory, if the copy fails you'll still have a set of results that could be retrieved. -- Main.IainRae - 18 Apr 2006
E
dit
|
A
ttach
|
P
rint version
|
H
istory
: r12
<
r11
<
r10
<
r9
<
r8
|
B
acklinks
|
V
iew topic
|
Ra
w
edit
|
M
ore topic actions
Topic revision: r12 - 08 May 2008 - 09:48:43 -
IainRae
DistributedComputing
DistributedComputing Web
DistributedComputing Web Home
Changes
Index
Search
Webs
ANC
Android4EDU
BDEteam
CSTR
CogsciTeach
ComputingStrategy
DICE
DReaM
DSOClets
DatabaseGroup
DistributedComputing
DiyDice
Dizzy
DocsByUsers
ECCOModelling
ECHOES
EyeTracking
F4K
FC3upgrade
Ftlwiki
INFBase
Inf2aWiki
JastProject
LINCS
LTG
MATCH
MLforNLP
Main
MusIC
PG
Prism
RaviProgress
SDP
SDPGroup1
SDPGroup10
SDPGroup3
SDPGroup5
SDPGroup6
SDPGroup7
SDPGroup8
SDPGroup9
SEOC
Sandbox
SecurityProgramme
SelfManaged
Seminars
StudioLab
TFlex
TWiki
TheBeast
Ug30708
Vademecum
VerbClasses
WebExp
YouTute
My links
My home page
edit
Copyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki?
Send feedback
This Wiki uses
Cookies