When you get an error like the following this means that MPI_Init is expecting argc/argv. NULL values being passed there are not sufficient. Passing NULL to MPI_Init works on the cluster at ASC though - that is, the program that gives the following error on Olympus runs perfectly on the cluster at ASC.
bash: line 1: 17212 Segmentation fault /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=compute-2-22.local MPIRUN_PORT=60739 MPIRUN_PROCESSES='compute-2-22:compute-2-22:compute-2-1:compute-2-1:compute-2-23:compute-2-23:compute-2-8:compute-2-8:compute-2-28:compute-2-28:compute-2-3:compute-2-3:compute-2-18:compute-2-18:compute-1-23:compute-1-23:compute-1-14:compute-1-14:compute-1-7:compute-1-7:compute-1-9:compute-1-9:compute-1-1:compute-1-1:compute-1-19:compute-1-19:compute-2-15:compute-2-15:compute-1-17:compute-1-17:compute-1-22:compute-1-22:compute-1-13:compute-1-13:compute-2-17:compute-2-17:compute-1-8:compute-1-8:compute-1-29:compute-1-29:compute-1-25:compute-1-25:compute-1-11:compute-1-11:compute-2-14:compute-2-14:compute-1-30:compute-1-30:compute-1-2:compute-1-2:' MPIRUN_RANK=49 MPIRUN_NPROCS=50 MPIRUN_ID=28494 /home/ritu/testcases/life/parallelgen/life 5000 5000 100000 10
Tuesday, April 20, 2010
Wednesday, March 24, 2010
Ant Script Problems
When stuck, check:
1) the version of the jar files that are being used. Make sure they are from a stable build. For example, ant-contrib-1.0b has bugs in it.
2) check the classpath. For example, { classname="FileCopy" } is correct but {
classname="C:/AMMA/Eclipse3.3/FileCopy/bin/FileCopy"} is not.
1) the version of the jar files that are being used. Make sure they are from a stable build. For example, ant-contrib-1.0b has bugs in it.
2) check the classpath. For example, { classname="FileCopy" } is correct but {
classname="C:/AMMA/Eclipse3.3/FileCopy/bin/FileCopy"} is not.
Subscribe to:
Posts (Atom)
