Question: causes to get 'core' files

Message

modzy · #1 Unread post by **modzy** » Tue Mar 04, 2014 2:38 pm

I did a successful mpi parallel, barotropic model run with a setup of a contant, say 500 m, bathymetry over the entire grid. I then doubled the number of vertical layers, say from the original 20 to 40 layers, and ran the model with reduced time step. This time after about two days of simulation, the run blew up which created a lot of cores files. However, it didn't save any 'nan' data into the perfect restart file as ROMS udually does in case of a blowing up due to nan. What might be the cause of the blowing up? Thanks!

kate · #2 Unread post by **kate** » Wed Mar 05, 2014 10:53 pm

ROMS is checking for unphysically large values for rho and velocity, as well as checking for things like NaN. It will report, then kill the job if it finds a problem.

Like ROMS, the Unix/Linux kernel is checking for certain conditions. If it finds an ill-behaved program it will kill it and generate a core file (or many). Many people find these core files unhelpful and limit their size through the coredumpsize limit. Instead, you can recompile your code with the USE_DEBUG switch and potentially find out where in ROMS your job died by using a debugger on the core file. There's more than one way your program can by ill-behaved from the point of view of the kernel and I don't know which off hand.

Ocean Modeling Discussion

Question: causes to get 'core' files

Question: causes to get 'core' files

Re: Question: causes to get 'core' files