(WorkerDict pid=2862157) [rank3]:[E923 11:14:11.615370309 ProcessGroupNCCL.cpp:1895] [PG ID 0 PG GUID 0(default_pg) Rank 3] Process group watchdog thread terminated with exception: CUDA error: ...
Hello, I am attempting to run a GPUMD simulation using a fine-tuned NEP model based on nep89_20250409.txt. The model.xyz file was validated and confirmed to be reasonable, and the simulation ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results