OpenMP issue on TACC

When i run cases with OpenMP on TACC. It gives that:

TACC: Starting up job 3008183
TACC: Starting parallel tasks...
OMP: Warning #52: OMP_SCHEDULE value " 4" is invalid chunk size.
OMP: Info #104: OMP_SCHEDULE value "0" will be used.
OMP: Warning #52: OMP_SCHEDULE value " 4" is invalid chunk size.
OMP: Info #104: OMP_SCHEDULE value "0" will be used.

Do I need to load any module for OpenMP? The submitted script is:

#!/bin/bash
#SBATCH -J mpm_oso        # job name
#SBATCH -o mpm.o   # output and error file name (%j expands to jobID)
#SBATCH -N 2          # number of nodes requested
#SBATCH -n 4              # total number of mpi tasks requested
#SBATCH -p development   # queue (partition) -- normal, development, etc.
#SBATCH -t 2:00:00       # run time (hh:mm:ss) - 18 hours
#SBATCH -A Material-Point-Metho
# Slurm email notifications
#SBATCH --mail-user= user.email@utexas.edu
#SBATCH --mail-type=begin   # email me when the job starts
#SBATCH --mail-type=end     # email me when the job finishes
# run the executable named a.out
module load intel/18.0.2
module load boost hdf5 vtk
module load cray_mpich/7.7.3
export OMP_SCHEDULE="static, 4"
ibrun $SCRATCH/mpm/build/mpm -f ./ -i mpm-hazel_stage1.json

Thanks @yliang-sn for raising this issue.

Should be:

export OMP_SCHEDULE=“static, 4”

No space between static and 4.

Thanks @kks32! I will run it late. LS5 is under system maintenance.

@yliang-sn is this resolved?

It works fine with -N 1 -n 1 or 2. When i tried with -N 2 -n 4. It will fail during the simulation:

[2020-08-27 18:03:28.658] [MPMExplicit] [info] Step: 84 of 1000.

MPM main: map::at
Rank 3 [Thu Aug 27 18:03:36 2020] [c0-0c0s7n2] application called MPI_Abort(MPI_COMM_WORLD, 1) - process 3
aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 3
MPM main: map::at
Rank 2 [Thu Aug 27 18:03:36 2020] [c0-0c0s7n2] application called MPI_Abort(MPI_COMM_WORLD, 1) - process 2
aborting job:
application called MPI_Abort(MPI_COMM_WORLD, 1) - process 2
srun: error: nid00030: tasks 2-3: Exited with exit code 255
srun: Terminating job step 3023533.0
slurmstepd: error: *** STEP 3023533.0 ON nid00011 CANCELLED AT 2020-08-27T18:03:41 ***
srun: error: nid00011: tasks 0-1: Terminated
srun: Force Terminated job step 3023533.0
TACC: MPI job exited with code: 143

TACC: Shutdown complete. Exiting.

Thanks @yliang-sn. This is not OpenMP related. Could you open a separate issue and provide input files?

Thanks, @kks32. I will test the same case on the stampede2.

@kks32 I recompiled and It works fine on both ls5 and stampede2 now. Thanks!