Posts by UBT - Timbo
log in
1) Message boards : Number crunching : Something has changed in SRBase (Message 8152)
Posted 21 Mar 2022 by UBT - Timbo
Historically, the completion time for Base, Short and Average tasks was typically about 2 hours. The Task Progress increased at a consistent rate from 0% to 100%.

Now, the Task Progress is progressing from 0% to 90% in about 2 hours, then it is taking an additional 6 - 8 hours to progress from 90% to 100%. This is occurring on my multiple computers.


I've had exactly the same issue on a number of "Riesel Base - short v0.22" CPU tasks, which I received during the recent Formula BOINC Sprint.

Many of these tasks did complete, but I had a number that got stuck at 100% after 8 hours...and this was on a 4-core Windows x64 PC.

And such was the lack of progress, I've had to abort many tasks (before the deadline), as I just knew the PCs would not process these other tasks before their deadline....even though my "cache" using BOINC Manager was set to 0.5+0.5 days...

Somehow SRBase is allowing more tasks to download, as it is using an inaccurate "Time to completion" for each task - estimates were around 24 minutes, but actual time is between 9,000 and 14,000 seconds (2.5 to 3.9 HOURS).


error msgs for 3 of these tasks:


Name R703_250-300k_wu_19075_0
Workunit 70607562

after 29,238.85 seconds

Stderr output
<core_client_version>7.16.20</core_client_version>
<![CDATA[
<message>
aborted by user</message>
<stderr_txt>
14:42:10 (8828): wrapper (7.5.26012): starting
14:42:10 (8828): wrapper: running llr.exe ( -d -oPgenInputFile=input.prp -oPgenOutputFile=primes.txt -oDiskWriteTime=10 -oOutputIterations=50000 -oResultsFileIterations=99999999)
Base factorized as : 19*37
Base prime factor(s) taken : 37
Starting N+1 prime test of 1902*703^269668-1
Using zero-padded FFT length 448K, Pass1=448, Pass2=1K, a = 3

1902*703^269668-1, bit: 50000 / 2550364 [1.96%]. Time per bit: 14.086 ms.
1902*703^269668-1, bit: 100000 / 2550364 [3.92%]. Time per bit: 14.207 ms.
1902*703^269668-1, bit: 150000 / 2550364 [5.88%]. Time per bit: 10.218 ms.
1902*703^269668-1, bit: 200000 / 2550364 [7.84%]. Time per bit: 14.059 ms.
1902*703^269668-1, bit: 250000 / 2550364 [9.80%]. Time per bit: 13.267 ms.
1902*703^269668-1, bit: 300000 / 2550364 [11.76%]. Time per bit: 11.433 ms.
1902*703^269668-1, bit: 350000 / 2550364 [13.72%]. Time per bit: 14.868 ms.
1902*703^269668-1, bit: 400000 / 2550364 [15.68%]. Time per bit: 12.796 ms.
1902*703^269668-1, bit: 450000 / 2550364 [17.64%]. Time per bit: 10.504 ms.
1902*703^269668-1, bit: 500000 / 2550364 [19.60%]. Time per bit: 12.821 ms.
1902*703^269668-1, bit: 550000 / 2550364 [21.56%]. Time per bit: 11.780 ms.
1902*703^269668-1, bit: 600000 / 2550364 [23.52%]. Time per bit: 13.059 ms.
1902*703^269668-1, bit: 650000 / 2550364 [25.48%]. Time per bit: 11.453 ms.
1902*703^269668-1, bit: 700000 / 2550364 [27.44%]. Time per bit: 12.440 ms.
1902*703^269668-1, bit: 750000 / 2550364 [29.40%]. Time per bit: 13.753 ms.
1902*703^269668-1, bit: 800000 / 2550364 [31.36%]. Time per bit: 14.325 ms.
1902*703^269668-1, bit: 850000 / 2550364 [33.32%]. Time per bit: 16.226 ms.
1902*703^269668-1, bit: 900000 / 2550364 [35.28%]. Time per bit: 9.337 ms.
1902*703^269668-1, bit: 950000 / 2550364 [37.24%]. Time per bit: 10.637 ms.
1902*703^269668-1, bit: 1000000 / 2550364 [39.21%]. Time per bit: 9.782 ms.
1902*703^269668-1, bit: 1050000 / 2550364 [41.17%]. Time per bit: 15.136 ms.
1902*703^269668-1, bit: 1100000 / 2550364 [43.13%]. Time per bit: 11.879 ms.
1902*703^269668-1, bit: 1150000 / 2550364 [45.09%]. Time per bit: 11.653 ms.
1902*703^269668-1, bit: 1200000 / 2550364 [47.05%]. Time per bit: 8.478 ms.
1902*703^269668-1, bit: 1250000 / 2550364 [49.01%]. Time per bit: 8.418 ms.
1902*703^269668-1, bit: 1300000 / 2550364 [50.97%]. Time per bit: 11.392 ms.
1902*703^269668-1, bit: 1350000 / 2550364 [52.93%]. Time per bit: 13.555 ms.
1902*703^269668-1, bit: 1400000 / 2550364 [54.89%]. Time per bit: 10.468 ms.
1902*703^269668-1, bit: 1450000 / 2550364 [56.85%]. Time per bit: 14.186 ms.
1902*703^269668-1, bit: 1500000 / 2550364 [58.81%]. Time per bit: 9.281 ms.
1902*703^269668-1, bit: 1550000 / 2550364 [60.77%]. Time per bit: 12.545 ms.
1902*703^269668-1, bit: 1600000 / 2550364 [62.73%]. Time per bit: 13.959 ms.
1902*703^269668-1, bit: 1650000 / 2550364 [64.69%]. Time per bit: 19.157 ms.
1902*703^269668-1, bit: 1700000 / 2550364 [66.65%]. Time per bit: 11.429 ms.
1902*703^269668-1, bit: 1750000 / 2550364 [68.61%]. Time per bit: 15.054 ms.
1902*703^269668-1, bit: 1800000 / 2550364 [70.57%]. Time per bit: 13.140 ms.
1902*703^269668-1, bit: 1850000 / 2550364 [72.53%]. Time per bit: 10.148 ms.
1902*703^269668-1, bit: 1900000 / 2550364 [74.49%]. Time per bit: 9.431 ms.
1902*703^269668-1, bit: 1950000 / 2550364 [76.45%]. Time per bit: 10.708 ms.
1902*703^269668-1, bit: 2000000 / 2550364 [78.42%]. Time per bit: 9.843 ms.
1902*703^269668-1, bit: 2050000 / 2550364 [80.38%]. Time per bit: 12.836 ms.
1902*703^269668-1, bit: 2100000 / 2550364 [82.34%]. Time per bit: 13.041 ms.
1902*703^269668-1, bit: 2150000 / 2550364 [84.30%]. Time per bit: 12.277 ms.
1902*703^269668-1, bit: 2200000 / 2550364 [86.26%]. Time per bit: 12.989 ms.
1902*703^269668-1, bit: 2250000 / 2550364 [88.22%]. Time per bit: 15.173 ms.
1902*703^269668-1, bit: 2300000 / 2550364 [90.18%]. Time per bit: 11.902 ms.
1902*703^269668-1, bit: 2350000 / 2550364 [92.14%]. Time per bit: 10.549 ms.
</stderr_txt>
]]>




Name R703_250-300k_wu_19078_0
Workunit 70607565

after 31,919.33 seconds


Stderr output
<core_client_version>7.16.20</core_client_version>
<![CDATA[
<message>
aborted by user</message>
<stderr_txt>
13:54:16 (1584): wrapper (7.5.26012): starting
13:54:16 (1584): wrapper: running llr.exe ( -d -oPgenInputFile=input.prp -oPgenOutputFile=primes.txt -oDiskWriteTime=10 -oOutputIterations=50000 -oResultsFileIterations=99999999)
Base factorized as : 19*37
Base prime factor(s) taken : 37
Starting N+1 prime test of 3422*703^269687-1
Using zero-padded FFT length 448K, Pass1=448, Pass2=1K, a = 3

3422*703^269687-1, bit: 50000 / 2550545 [1.96%]. Time per bit: 21.082 ms.
3422*703^269687-1, bit: 100000 / 2550545 [3.92%]. Time per bit: 14.029 ms.
3422*703^269687-1, bit: 150000 / 2550545 [5.88%]. Time per bit: 20.408 ms.
3422*703^269687-1, bit: 200000 / 2550545 [7.84%]. Time per bit: 17.811 ms.
3422*703^269687-1, bit: 250000 / 2550545 [9.80%]. Time per bit: 17.890 ms.
3422*703^269687-1, bit: 300000 / 2550545 [11.76%]. Time per bit: 18.484 ms.
3422*703^269687-1, bit: 350000 / 2550545 [13.72%]. Time per bit: 18.293 ms.
3422*703^269687-1, bit: 400000 / 2550545 [15.68%]. Time per bit: 17.964 ms.
3422*703^269687-1, bit: 450000 / 2550545 [17.64%]. Time per bit: 17.240 ms.
3422*703^269687-1, bit: 500000 / 2550545 [19.60%]. Time per bit: 17.282 ms.
3422*703^269687-1, bit: 550000 / 2550545 [21.56%]. Time per bit: 15.499 ms.
3422*703^269687-1, bit: 600000 / 2550545 [23.52%]. Time per bit: 13.245 ms.
3422*703^269687-1, bit: 650000 / 2550545 [25.48%]. Time per bit: 16.008 ms.
3422*703^269687-1, bit: 700000 / 2550545 [27.44%]. Time per bit: 17.417 ms.
3422*703^269687-1, bit: 750000 / 2550545 [29.40%]. Time per bit: 13.814 ms.
3422*703^269687-1, bit: 800000 / 2550545 [31.36%]. Time per bit: 18.525 ms.
3422*703^269687-1, bit: 850000 / 2550545 [33.32%]. Time per bit: 18.470 ms.
3422*703^269687-1, bit: 900000 / 2550545 [35.28%]. Time per bit: 24.651 ms.
3422*703^269687-1, bit: 950000 / 2550545 [37.24%]. Time per bit: 17.765 ms.
3422*703^269687-1, bit: 1000000 / 2550545 [39.20%]. Time per bit: 18.780 ms.
3422*703^269687-1, bit: 1050000 / 2550545 [41.16%]. Time per bit: 25.783 ms.
3422*703^269687-1, bit: 1100000 / 2550545 [43.12%]. Time per bit: 19.863 ms.
3422*703^269687-1, bit: 1150000 / 2550545 [45.08%]. Time per bit: 13.764 ms.
3422*703^269687-1, bit: 1200000 / 2550545 [47.04%]. Time per bit: 14.873 ms.
3422*703^269687-1, bit: 1250000 / 2550545 [49.00%]. Time per bit: 16.562 ms.
3422*703^269687-1, bit: 1300000 / 2550545 [50.96%]. Time per bit: 15.547 ms.
3422*703^269687-1, bit: 1350000 / 2550545 [52.92%]. Time per bit: 19.572 ms.
3422*703^269687-1, bit: 1400000 / 2550545 [54.89%]. Time per bit: 17.301 ms.
3422*703^269687-1, bit: 1450000 / 2550545 [56.85%]. Time per bit: 18.368 ms.
3422*703^269687-1, bit: 1500000 / 2550545 [58.81%]. Time per bit: 18.188 ms.
3422*703^269687-1, bit: 1550000 / 2550545 [60.77%]. Time per bit: 17.583 ms.
3422*703^269687-1, bit: 1600000 / 2550545 [62.73%]. Time per bit: 25.763 ms.
3422*703^269687-1, bit: 1650000 / 2550545 [64.69%]. Time per bit: 19.545 ms.
3422*703^269687-1, bit: 1700000 / 2550545 [66.65%]. Time per bit: 19.310 ms.
3422*703^269687-1, bit: 1750000 / 2550545 [68.61%]. Time per bit: 14.516 ms.
</stderr_txt>
]]>




Name R703_250-300k_wu_18879_0
Workunit 70607366

after 28,989.31 seconds (8+ hours)

[quote]Stderr output
<core_client_version>7.16.20</core_client_version>
<![CDATA[
<message>
aborted by user</message>
<stderr_txt>
14:46:25 (8936): wrapper (7.5.26012): starting
14:46:25 (8936): wrapper: running llr.exe ( -d -oPgenInputFile=input.prp -oPgenOutputFile=primes.txt -oDiskWriteTime=10 -oOutputIterations=50000 -oResultsFileIterations=99999999)
Base factorized as : 19*37
Base prime factor(s) taken : 37
Starting N+1 prime test of 494*703^268972-1
Using zero-padded FFT length 448K, Pass1=448, Pass2=1K, a = 3

494*703^268972-1, bit: 50000 / 2543780 [1.96%]. Time per bit: 15.066 ms.
494*703^268972-1, bit: 100000 / 2543780 [3.93%]. Time per bit: 16.988 ms.
494*703^268972-1, bit: 150000 / 2543780 [5.89%]. Time per bit: 13.458 ms.
494*703^268972-1, bit: 200000 / 2543780 [7.86%]. Time per bit: 15.213 ms.
494*703^268972-1, bit: 250000 / 2543780 [9.82%]. Time per bit: 11.652 ms.
494*703^268972-1, bit: 300000 / 2543780 [11.79%]. Time per bit: 14.904 ms.
494*703^268972-1, bit: 350000 / 2543780 [13.75%]. Time per bit: 11.022 ms.
494*703^268972-1, bit: 400000 / 2543780 [15.72%]. Time per bit: 12.506 ms.
494*703^268972-1, bit: 450000 / 2543780 [17.69%]. Time per bit: 11.886 ms.
494*703^268972-1, bit: 500000 / 2543780 [19.65%]. Time per bit: 12.464 ms.
494*703^268972-1, bit: 550000 / 2543780 [21.62%]. Time per bit: 16.945 ms.
494*703^268972-1, bit: 600000 / 2543780 [23.58%]. Time per bit: 15.245 ms.
494*703^268972-1, bit: 650000 / 2543780 [25.55%]. Time per bit: 11.889 ms.
494*703^268972-1, bit: 700000 / 2543780 [27.51%]. Time per bit: 14.020 ms.
494*703^268972-1, bit: 750000 / 2543780 [29.48%]. Time per bit: 17.374 ms.
494*703^268972-1, bit: 800000 / 2543780 [31.44%]. Time per bit: 23.795 ms.
494*703^268972-1, bit: 850000 / 2543780 [33.41%]. Time per bit: 18.197 ms.
494*703^268972-1, bit: 900000 / 2543780 [35.38%]. Time per bit: 14.267 ms.
494*703^268972-1, bit: 950000 / 2543780 [37.34%]. Time per bit: 24.296 ms.
494*703^268972-1, bit: 1000000 / 2543780 [39.31%]. Time per bit: 19.856 ms.
494*703^268972-1, bit: 1050000 / 2543780 [41.27%]. Time per bit: 9.323 ms.
494*703^268972-1, bit: 1100000 / 2543780 [43.24%]. Time per bit: 14.670 ms.
494*703^268972-1, bit: 1150000 / 2543780 [45.20%]. Time per bit: 17.432 ms.
494*703^268972-1, bit: 1200000 / 2543780 [47.17%]. Time per bit: 20.501 ms.
494*703^268972-1, bit: 1250000 / 2543780 [49.13%]. Time per bit: 15.702 ms.
494*703^268972-1, bit: 1300000 / 2543780 [51.10%]. Time per bit: 12.189 ms.
494*703^268972-1, bit: 1350000 / 2543780 [53.07%]. Time per bit: 9.402 ms.
494*703^268972-1, bit: 1400000 / 2543780 [55.03%]. Time per bit: 16.164 ms.
494*703^268972-1, bit: 1450000 / 2543780 [57.00%]. Time per bit: 15.948 ms.
494*703^268972-1, bit: 1500000 / 2543780 [58.96%]. Time per bit: 12.368 ms.
494*703^268972-1, bit: 1550000 / 2543780 [60.93%]. Time per bit: 14.206 ms.
494*703^268972-1, bit: 1600000 / 2543780 [62.89%]. Time per bit: 12.777 ms.
494*703^268972-1, bit: 1650000 / 2543780 [64.86%]. Time per bit: 16.550 ms.
494*703^268972-1, bit: 1700000 / 2543780 [66.82%]. Time per bit: 19.832 ms.
494*703^268972-1, bit: 1750000 / 2543780 [68.79%]. Time per bit: 22.443 ms.
494*703^268972-1, bit: 1800000 / 2543780 [70.76%]. Time per bit: 20.650 ms.
494*703^268972-1, bit: 1850000 / 2543780 [72.72%]. Time per bit: 14.442 ms.
</stderr_txt>
]]>
{/quote]


In all 3 cases, the stderr.txt files (in the slot diretories) were still being written to, until just before the task was aborted...which implies the tasks were still being crunched...


regards
Tim
2) Message boards : Number crunching : Stuck at 100% (Message 1454)
Posted 21 May 2015 by UBT - Timbo
Then another program has grabbed all the power.


Hi Reb,

Unfortunately, that's not the case - this particular PC *only* crunches BOINC - no other programs run on it.

So, there must be another answer?

I'll see what happens when the next tasks download and see what happens...thus far the short WU's are being crunched OK, and until the other day, the average WU's were OK too...

regards
Tim
3) Message boards : Number crunching : Stuck at 100% (Message 1448)
Posted 21 May 2015 by UBT - Timbo
The GPU is the problem. It needs a full CPU core so it can slow down the calculations. You dont need more RAM, a WU is taking around 65MB. If you run more than one project check the RAM usage, BU can take a lot.


Hi Reb,

But I set the BOINC Manager preferences to only use 75% of the CPU's available. So, only 3 are used...not 4. That means the GPU has full access to one CPU core, and I can then run some CPU tasks, plus some ASIC's.

I'll will reduce the CPU preference to (say) 30% and see if any more SRBase tasks complete OK.

regards
Tim
4) Message boards : Number crunching : Stuck at 100% (Message 1441)
Posted 19 May 2015 by UBT - Timbo
finish file present too long

The computer was busy and had no free ressources to write the result file so it error out.


Hi Reb,

Thanks for your reply.

OK, so this is a quad-core Intel-CPU PC, running Win 7 (x64) with 2 Gb RAM and 400 Gb of free space on the HDD. Task manager shows lots of spare CPU capacity and very little paging to the hard drive.

Otherwise, it only runs BU (via ASIC's) and WU-Prop - that's it - it's a dedicated BOINC machine.....with 3 cores dedicated to CPU tasks. Plus, there's a GPU running Moo.

How can it need more "resources" than this, to give to SRBase? Do I need more RAM, because that's the only thing that I can do to "upgrade" this PC?

regards
Tim
5) Message boards : Number crunching : Stuck at 100% (Message 1439)
Posted 19 May 2015 by UBT - Timbo
Application Sierpinski/Riesel Base - long 0.04. Latest BOINC Manager 7.4.42 (x64), running on Windows 7.

Four tasks have been at 100% for the past 12 hours or so. Screenshot attached. Should I worry, should I let them run, should I abort?

Thanks.

http://i.imgur.com/rrr2bbo.png


Hi all,

I just had this same issue - 2 WU's at 100% and "Waiting to run".....

In the meantime other project tasks started.....so, when I saw this I suspended these other tasks and BM re-started the SRBase WU's...but within seconds, both WU's were "Computation error"...so that's 4+ hours wasted.

http://srbase.myfirewall.org/sr5/result.php?resultid=34818320
http://srbase.myfirewall.org/sr5/result.php?resultid=34818184

These are both "Sierpinski / Riesel Base - average v0.04" WU's.

I haven't had this issue before, so do not know the reason - PC has been working fine before now:

http://srbase.myfirewall.org/sr5/results.php?userid=327&offset=0&show_names=0&state=4&appid=13

regards
Tim




Main page · Your account · Message boards


Copyright © 2014-2024 BOINC Confederation / rebirther