|
CSIRO Linux GPU ClusterSpecialised Graphical Processing Unit computing facilityGetting Started
User InformationAccessing the GPU cluster To access the test cluster, users should ssh to the test development/head node, called testgpu.csiro.au Users with an account can log in using their nexus ident as their username, and entering their nexus password when prompted for a password. Please Contact a ASC Cluster Administrator and request to add your NEXUS account to the GPU cluster if you do not have an account on the GPU cluster. (Assistance email below) Logging In Running Jobs on the GPU cluster Please note: For job submission logon to linuxgpu and then submit your job by running qsub "jobname" Interactive jobs and debugging can be done on the Test, Development and Training system, "testgpu.csiro.au" We will also be setting up a test queue with a small number of test nodes
available for users to test their batch jobs before moving the jobs to the
production cluster. Once this is set up, users will be able to do: to have their jobs run on a test node. There is a guide for using the torque batch system at: http://www.hpsc.csiro.au/userguides/blade/localguide.php#PBS Using software packages on the GPU Cluster We have also specific software packages such as the intel compilers and openmpi into a shared directory called /tools, and set up modules so users can have their paths and environment set up to use the software installed in /tools. To see what software packages in /tools are available under modules, do: To load a particular package, do: If you wish to compile code using cuda libraries, load the following
modules: Storage, file systems, quotas and backups Currently /home is not backed up so you are required to maintain your own copy of any files you generate on another file server. When the high performance storage (HNAS) is brought into operation on Thursday 26/11/2009, /home will also be backed up. /data and /flush are intended to provide users with temporary space for use while running jobs. Neither /data and /flush will be backed up, and /flush will be set up so that files older than 7 days may be automatically deleted to free up space if /flush fills. All shared filesystems will have user quotas implemented on them when the HNAS storage system becomes operational. The following user quotas are set on the shared file systems: /home 10GB We have also set a default use quota of 150,000 inodes for each file system, to help protect against runaway jobs. If you need a higher inode quota please contact us. Help and Resources There are various resources available for people wishing to learn more about GPU development. These are available at: https://wiki.csiro.au/confluence/display/terabyte/GPU+Forum http://www.nvidia.com/object/cuda_home.html
|