log in |
Message boards : Number crunching : short deadlines cause panic mode
Author | Message |
---|---|
Can anything please be done about the extremely short deadlines? They are causing all tasks to run in panic mode, which messes with multiple GPU machines, causing all but one to sit idle. | |
ID: 1720 · Rating: 0 · rate: / Reply Quote | |
Can anything please be done about the extremely short deadlines? They are causing all tasks to run in panic mode, which messes with multiple GPU machines, causing all but one to sit idle. Do you have set some cores free for the GPU? | |
ID: 1721 · Rating: 0 · rate: / Reply Quote | |
Yes, I have the app_config.xml set to reserve a full thread per GPU. he problem is not that the GPUs do not get enough CPU cycles. The problem is with BOINC scheduling. When CPU tasks go into panic mode, they take over all available threads, and only one GPU tasks will run. | |
ID: 1727 · Rating: 0 · rate: / Reply Quote | |
Yes, I have the app_config.xml set to reserve a full thread per GPU. he problem is not that the GPUs do not get enough CPU cycles. The problem is with BOINC scheduling. When CPU tasks go into panic mode, they take over all available threads, and only one GPU tasks will run. You should set your CPU cores in BM to 90%. So you have always 1 core free for GPU. | |
ID: 1728 · Rating: 0 · rate: / Reply Quote | |
That doesn't solve the problem. | |
ID: 1729 · Rating: 0 · rate: / Reply Quote | |
That doesn't solve the problem. ok, but its not working with older clients. | |
ID: 1730 · Rating: 0 · rate: / Reply Quote | |
Agreed. Longer deadlines would solve the problem for everyone. Panic mode causes the BOINC client to do abnormal things. | |
ID: 1731 · Rating: 0 · rate: / Reply Quote | |
Agreed. Longer deadlines would solve the problem for everyone. Panic mode causes the BOINC client to do abnormal things. Only the small bases have a 1 day runtime because if I make it longer and a mix of long and short ones will have a longer waiting time and run out of work. | |
ID: 1732 · Rating: 0 · rate: / Reply Quote | |
Something that would help - but I'm not sure if it can be done so easily : | |
ID: 2177 · Rating: 0 · rate: / Reply Quote | |
Something that would help - but I'm not sure if it can be done so easily : I have reduced the deadline from 6 to 4 days for the long runners. As long as the first WU was sending back and the second was not running by another host the server is sending an abort signal. | |
ID: 2180 · Rating: 0 · rate: / Reply Quote | |
Message boards :
Number crunching :
short deadlines cause panic mode