Open
Description
upto 8,320 cpus for 2-5 hours works ok, but less workers for more time may still have GC issues.
Look at
- https://discourse.julialang.org/t/garbage-collection-not-aggressive-enough-on-slurm-cluster
- or lazy addprocs in Memory leak for lazy worker to worker connections JuliaLang/julia#28887
- https://itensor.github.io/ITensors.jl/dev/faq/HPC.html
heap_size_hint
with-exeflags
on workers Interaction betweenaddprocs
and--heap-size-hint
JuliaLang/julia#50673
Metadata
Metadata
Assignees
Labels
No labels