Long 2 are a joke
log in

Advanced search

Message boards : Number crunching : Long 2 are a joke

Previous · 1 · 2 · 3
Author Message
Shadak
Send message
Joined: 29 Nov 18
Posts: 5
Credit: 629,213
RAC: 0
Message 5148 - Posted: 25 Apr 2019, 14:27:43 UTC - in response to Message 5142.
Last modified: 25 Apr 2019, 14:28:00 UTC

so. all my long2 are now marked for resending...I'm 2-3 days away from finishing...


it seems like I'm lucky. my longs didn't got send to someone else and my WUs are between 97% and 98%. they will be done today.
____________

Profile JStateson
Send message
Joined: 30 Apr 19
Posts: 6
Credit: 182,390
RAC: 0
Message 5165 - Posted: 3 May 2019, 3:58:58 UTC

I had five long2 running for just over 2 days on my i9-7900X with 5 hours remaining. After reading about allowing "mt" I picked up that app_config.xml and put it into my srbase project directory. I then requested a "read" of the xml. OK, it was read but no extra CPUs were assigned. Still using 1 cpu each and I got 20 cpu threads total.

I then surmised that I had to stop and start boinc to make it happen. That did not work either. In fact, it made things MUCH WORSE. It seems there is no checkpointing going on. Those 5 tasks that had run for over 2 days started over at "0". They now show 35 minutes elapsed and 20 hours remaining instead of 2+ days elapsed and 5 hours remaining.

As soon as I post this I will abort them and go back to number fields.

Profile PDW
Send message
Joined: 15 Oct 15
Posts: 41
Credit: 696,427,546
RAC: 474,985
Message 5166 - Posted: 3 May 2019, 6:12:59 UTC - in response to Message 5165.

I had five long2 running for just over 2 days on my i9-7900X with 5 hours remaining. After reading about allowing "mt" I picked up that app_config.xml and put it into my srbase project directory. I then requested a "read" of the xml. OK, it was read but no extra CPUs were assigned. Still using 1 cpu each and I got 20 cpu threads total.

I then surmised that I had to stop and start boinc to make it happen. That did not work either. In fact, it made things MUCH WORSE. It seems there is no checkpointing going on. Those 5 tasks that had run for over 2 days started over at "0". They now show 35 minutes elapsed and 20 hours remaining instead of 2+ days elapsed and 5 hours remaining.

As soon as I post this I will abort them and go back to number fields.

Well that was a waste, looking at the log file clearly shows that checkpoints were happening, in fact they were all over 80% done, the highest was at 83.98% when you aborted it...

Resuming N+1 prime test of 3656*22^4632494-1 at bit 17147137 [83.00%] Using FMA3 FFT length 2016K, Pass1=448, Pass2=4608, a = 3 3656*22^4632494-1, bit: 17150000 / 20658303 [83.01%]. Time per bit: 222.012 ms. 3656*22^4632494-1, bit: 17200000 / 20658303 [83.25%]. Time per bit: 12.610 ms. 3656*22^4632494-1, bit: 17250000 / 20658303 [83.50%]. Time per bit: 12.601 ms. 3656*22^4632494-1, bit: 17300000 / 20658303 [83.74%]. Time per bit: 12.709 ms. 3656*22^4632494-1, bit: 17350000 / 20658303 [83.98%]. Time per bit: 12.622 ms. </stderr_txt>

Profile JStateson
Send message
Joined: 30 Apr 19
Posts: 6
Credit: 182,390
RAC: 0
Message 5167 - Posted: 3 May 2019, 23:50:22 UTC - in response to Message 5166.

I looked at that data here and you are correct that the checkpointing was working. Be that as it may, I suspect it would have remained around %83 done for the next several days. When I restarted BOINC the estimated hours changed as I indicated. That could be a feature of boinc, not remembering starting over with an estimate.

I did complete several work units so not all was lost.

Maybe you can tell me what went wrong with app_config When I rebooted I did not get any more than 1 CPU on those long2.

This system was processing 3 gpugrid task but all other CPUs (about 18) were available. Only 5 long2 were running. Something missing or left out?

Profile PDW
Send message
Joined: 15 Oct 15
Posts: 41
Credit: 696,427,546
RAC: 474,985
Message 5168 - Posted: 4 May 2019, 6:09:40 UTC - in response to Message 5167.

See this thread for more info about checkpoints: http://srbase.my-firewall.org/sr5/forum_thread.php?id=1129

Can you post the contents of your app_config.xml file please.
[I know you linked to one you found but want to see what you actually used.]

[SG]Felix
Avatar
Send message
Joined: 25 Dec 17
Posts: 63
Credit: 12,867,590
RAC: 12,084
Message 5169 - Posted: 4 May 2019, 9:35:55 UTC - in response to Message 5168.


[I know you linked to one you found but want to see what you actually used.]



the linked one is mine, it should work.
maybe it isn't in the right directory

Profile Michael Goetz
Avatar
Send message
Joined: 1 Jan 15
Posts: 25
Credit: 1,197,466
RAC: 297
Message 5170 - Posted: 4 May 2019, 11:19:33 UTC

Bottom line is:

Ignore the percentage done.

Ignore the time estimates. Since the percent done is wrong, the estimates are useless.

Checkpointing works. Absolutely, Positively. You can trust it.
____________
Want to find one of the largest known primes? Try PrimeGrid. Or help cure disease at WCG.

Profile JStateson
Send message
Joined: 30 Apr 19
Posts: 6
Credit: 182,390
RAC: 0
Message 5172 - Posted: 4 May 2019, 11:30:46 UTC - in response to Message 5169.
Last modified: 4 May 2019, 11:35:23 UTC


[I know you linked to one you found but want to see what you actually used.]



the linked one is mine, it should work.
maybe it isn't in the right directory


seeing this in event log - is it a problem?

5/4/2019 6:27:38 AM | SRBase | Found app_config.xml 5/4/2019 6:27:38 AM | SRBase | unexpected text '(for WUprop optional)' in app_config.xml


how can I tell if more than 1 cpu is being used?

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 5426
Credit: 23,904,520
RAC: 11,313
Message 5173 - Posted: 4 May 2019, 11:32:59 UTC - in response to Message 5172.


[I know you linked to one you found but want to see what you actually used.]



the linked one is mine, it should work.
maybe it isn't in the right directory


seeing this in event log - is it a problem?

5/4/2019 6:27:38 AM | SRBase | Found app_config.xml 5/4/2019 6:27:38 AM | SRBase | unexpected text '(for WUprop optional)' in app_config.xml


remove this (for WUprop optional) from config file

Profile JStateson
Send message
Joined: 30 Apr 19
Posts: 6
Credit: 182,390
RAC: 0
Message 5174 - Posted: 4 May 2019, 11:40:42 UTC - in response to Message 5173.


[I know you linked to one you found but want to see what you actually used.]



the linked one is mine, it should work.
maybe it isn't in the right directory


seeing this in event log - is it a problem?

5/4/2019 6:27:38 AM | SRBase | Found app_config.xml 5/4/2019 6:27:38 AM | SRBase | unexpected text '(for WUprop optional)' in app_config.xml


remove this (for WUprop optional) from config file


Done, but things got worse
5/4/2019 6:37:49 AM | SRBase | Found app_config.xml 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase2'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase3'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase4'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase7'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase8'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase10'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase2'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase3'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase4'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase7'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase8'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase10'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9'

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 5426
Credit: 23,904,520
RAC: 11,313
Message 5176 - Posted: 4 May 2019, 11:46:58 UTC - in response to Message 5174.
Last modified: 4 May 2019, 11:47:42 UTC



Done, but things got worse
5/4/2019 6:37:49 AM | SRBase | Found app_config.xml 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase2'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase3'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase4'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase7'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase8'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase10'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase2'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase3'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase4'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase7'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase8'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9' 5/4/2019 6:37:49 AM | SRBase | Your app_config.xml file refers to an unknown application 'srbase10'. Known applications: 'srbase11', 'srbase12', 'srbase5', 'srbase6', 'srbase9'


Thats not a problem, you havent yet calculated some of these apps. If you have returned one result the errors are gone.

Profile JStateson
Send message
Joined: 30 Apr 19
Posts: 6
Credit: 182,390
RAC: 0
Message 5177 - Posted: 4 May 2019, 11:50:03 UTC - in response to Message 5176.
Last modified: 4 May 2019, 11:56:08 UTC

RUNNING WITH 4 CPUs might get a little toastie at 73c

had to restart boinc, not just re-read that app file

Profile PDW
Send message
Joined: 15 Oct 15
Posts: 41
Credit: 696,427,546
RAC: 474,985
Message 5178 - Posted: 4 May 2019, 11:55:28 UTC - in response to Message 5177.

RUNNING WITH 4 CPUs

had to restart boinc, not just re-read that app file

So if you had got your app_config.xml file right, what you did earlier when you re-started Boinc would have seen everything work as expected ?

Profile JStateson
Send message
Joined: 30 Apr 19
Posts: 6
Credit: 182,390
RAC: 0
Message 5179 - Posted: 4 May 2019, 11:59:48 UTC - in response to Message 5178.

RUNNING WITH 4 CPUs

had to restart boinc, not just re-read that app file

So if you had got your app_config.xml file right, what you did earlier when you re-started Boinc would have seen everything work as expected ?



I had the right app file, but it was wrong ;<)

Previous · 1 · 2 · 3
Post to thread

Message boards : Number crunching : Long 2 are a joke


Main page · Your account · Message boards


Copyright © 2014-2021 BOINC Confederation / rebirther