Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Get random seg faults when starting a job when multiple jobs on a GPU are running #393

Open
bc118 opened this issue Dec 5, 2021 · 0 comments

Comments

@bc118
Copy link
Collaborator

bc118 commented Dec 5, 2021

Describe the bug
GOMC dev code build on 11-27-21 yeilds random (does not happen all the time) segfaults when starting a job when multiple jobs on a GPU are running. Not 100% sure if this is a GOMC or GPU issue.

To Reproduce
Run the code I provide in the GOMC issues slack, as the code here is confidential. You will need manual access to a GPU so it can be potentially double -loaded. I personally loaded/ran them one after each other triple-loading the GPU.

Expected behavior
I would expect the code not to seg fault if a GPU is loaded with more than 1 GOMC simulation

Screenshots
No real info except "Segmentation fault (core dumped)"

Input files
Run the code I provide in the GOMC issues slack, as the code here is confidential.

Please complete the following information:

  • OS: RHEL 8.4
  • Ensemble: GEMC-NPT
  • Code version : GOMC dev code built with the code from 11-27-2021

Additional context
Add any other context about the problem here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant