New TF multiGPU apps deployed (issues fixed)
log in

Advanced search

Message boards : News : New TF multiGPU apps deployed (issues fixed)

1 · 2 · 3 · 4 · Next
Author Message
Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9599 - Posted: 3 Feb 2024, 14:34:28 UTC
Last modified: 4 Feb 2024, 19:20:37 UTC

You need v28 to run it. There is only support for AMD and Nvidia cards.

To run more than 1 GPU you must have this cc_config:

<cc_config> <options> <use_all_gpus>1</use_all_gpus> </options> </cc_config>

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9600 - Posted: 3 Feb 2024, 14:37:10 UTC
Last modified: 3 Feb 2024, 17:35:55 UTC

There are still some errors in download. Need fix them soon.

Update:
ERR_RESULT_DOWNLOAD - The apps have different sizes but the same names, take a while to see some good results

Still looking for bad results, something is odd

Working apps so far:

cuda111 linux v23
openati linux v22
cuda120 linux v28

Profile Steve Dodd
Send message
Joined: 12 Sep 16
Posts: 7
Credit: 139,273,101
RAC: 44,819
Message 9603 - Posted: 3 Feb 2024, 17:18:48 UTC - in response to Message 9600.

All WU downloads are failing on Windows 10 that are still configured to run on only 1 GPU.
____________

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9604 - Posted: 3 Feb 2024, 17:21:04 UTC - in response to Message 9603.

All WU downloads are failing on Windows 10 that are still configured to run on only 1 GPU.


yes, Iam on it since 2h, perhaps I found a solution but who knows...

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9605 - Posted: 3 Feb 2024, 18:10:11 UTC
Last modified: 3 Feb 2024, 18:23:49 UTC

It should be fixed now, the wrapper had no permission status thats why all downloads failed. v28 also fixed the commandline in wrapper.

Profile Steve Dodd
Send message
Joined: 12 Sep 16
Posts: 7
Credit: 139,273,101
RAC: 44,819
Message 9606 - Posted: 3 Feb 2024, 20:13:17 UTC

Tried downloading new WU and have the following results:

2/3/2024 12:08:06 PM | SRBase | Started download of worktodo13a322_0231494.txt
2/3/2024 12:08:06 PM | SRBase | Started download of worktodo13a322_0238450.txt
2/3/2024 12:08:06 PM | SRBase | Started download of worktodo13a322_0238451.txt
2/3/2024 12:08:07 PM | SRBase | Finished download of worktodo13a322_0231494.txt (57 bytes)
2/3/2024 12:08:07 PM | SRBase | Finished download of worktodo13a322_0238450.txt (57 bytes)
2/3/2024 12:08:07 PM | SRBase | Finished download of worktodo13a322_0238451.txt (57 bytes)

but on the Tasks page of BOINC is shows all downloads failed. Shucks :(

TRINITAS
Send message
Joined: 19 Jan 24
Posts: 12
Credit: 12,570,000
RAC: 0
Message 9607 - Posted: 3 Feb 2024, 20:19:55 UTC

I receive the WUs, but it still calculates individually.

Profile Steve Dodd
Send message
Joined: 12 Sep 16
Posts: 7
Credit: 139,273,101
RAC: 44,819
Message 9608 - Posted: 3 Feb 2024, 20:40:09 UTC

And I just tried downloading again and now it works :) Must have been some leftovers.

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9609 - Posted: 3 Feb 2024, 21:33:30 UTC - in response to Message 9608.

And I just tried downloading again and now it works :) Must have been some leftovers.


yeah, I will deprecate 5 app versions soon.

TRINITAS
Send message
Joined: 19 Jan 24
Posts: 12
Credit: 12,570,000
RAC: 0
Message 9610 - Posted: 3 Feb 2024, 21:43:47 UTC

Do the GPUs have to be of identical architectures (Like an RTX 2080 with an RTX 2070) or can it be RTX 2080 with TITAN Xp. And the same for AMD?

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9611 - Posted: 3 Feb 2024, 21:49:24 UTC - in response to Message 9610.

Do the GPUs have to be of identical architectures (Like an RTX 2080 with an RTX 2070) or can it be RTX 2080 with TITAN Xp. And the same for AMD?


yes, but not a mix of AMD and Nvidia.

Mr P Hucker
Avatar
Send message
Joined: 30 Sep 17
Posts: 36
Credit: 16,105,684
RAC: 0
Message 9614 - Posted: 4 Feb 2024, 9:27:05 UTC - in response to Message 9611.
Last modified: 4 Feb 2024, 9:28:09 UTC

Not working in Windows 11 with two of AMD Radeon R9 280X cards (old things - 3GB, OpenCL 1.2). Both run and show progress in Boinc, but the second one shows 0% usage and generates no heat.

I assume the required version you speak of is the version of SRBase, which Boinc updates itself? I have 0.28 against the running tasks.

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9615 - Posted: 4 Feb 2024, 9:36:54 UTC - in response to Message 9614.

Not working in Windows 11 with two of AMD Radeon R9 280X cards (old things - 3GB, OpenCL 1.2). Both run and show progress in Boinc, but the second one shows 0% usage and generates no heat.

I assume the required version you speak of is the version of SRBase, which Boinc updates itself? I have 0.28 against the running tasks.


Can you post the stderr.txt of the 2nd card?

Mr P Hucker
Avatar
Send message
Joined: 30 Sep 17
Posts: 36
Credit: 16,105,684
RAC: 0
Message 9616 - Posted: 4 Feb 2024, 9:52:06 UTC - in response to Message 9615.

Not working in Windows 11 with two of AMD Radeon R9 280X cards (old things - 3GB, OpenCL 1.2). Both run and show progress in Boinc, but the second one shows 0% usage and generates no heat.

I assume the required version you speak of is the version of SRBase, which Boinc updates itself? I have 0.28 against the running tasks.

Can you post the stderr.txt of the 2nd card?

I don't have that file, or know where to find it.
I have stderrdae.txt, stderrgpudetect.txt, stdoutdae.txt, stdoutgpudetect.txt in c:\programdata\boinc.

Mr P Hucker
Avatar
Send message
Joined: 30 Sep 17
Posts: 36
Credit: 16,105,684
RAC: 0
Message 9617 - Posted: 4 Feb 2024, 9:53:29 UTC - in response to Message 9616.

Ah I see, I think I need to let it run to completion (if it will). Trying, will get back to you....

Mr P Hucker
Avatar
Send message
Joined: 30 Sep 17
Posts: 36
Credit: 16,105,684
RAC: 0
Message 9618 - Posted: 4 Feb 2024, 9:57:01 UTC - in response to Message 9617.

In the meantime you can view one which started on the second card and I aborted after a few minutes:

https://srbase.my-firewall.org/sr5/result.php?resultid=141051766

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9619 - Posted: 4 Feb 2024, 9:57:23 UTC - in response to Message 9616.

Not working in Windows 11 with two of AMD Radeon R9 280X cards (old things - 3GB, OpenCL 1.2). Both run and show progress in Boinc, but the second one shows 0% usage and generates no heat.

I assume the required version you speak of is the version of SRBase, which Boinc updates itself? I have 0.28 against the running tasks.

Can you post the stderr.txt of the 2nd card?

I don't have that file, or know where to find it.
I have stderrdae.txt, stderrgpudetect.txt, stdoutdae.txt, stdoutgpudetect.txt in c:\programdata\boinc.


Check your data/slots folder in BOINC, click on a WU-->properties in BOINCmanager where you can find the right slot number.

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9620 - Posted: 4 Feb 2024, 10:00:43 UTC - in response to Message 9618.
Last modified: 4 Feb 2024, 10:02:41 UTC

In the meantime you can view one which started on the second card and I aborted after a few minutes:

https://srbase.my-firewall.org/sr5/result.php?resultid=141051766


Looks good so far. Perhaps the progress bar in BOINC is not working for the 2nd card. You can try to check GPUZ if the card is doing something.

Mr P Hucker
Avatar
Send message
Joined: 30 Sep 17
Posts: 36
Credit: 16,105,684
RAC: 0
Message 9621 - Posted: 4 Feb 2024, 10:02:57 UTC - in response to Message 9620.

Well there's no heat coming from the card and MSI afterburner shows 0% usage. Boinc and the text file claim it's progressing, I assume the end result will be invalid. The one still running will be done in 7 minutes.

Profile rebirther
Volunteer moderator
Project administrator
Project developer
Project tester
Project scientist
Avatar
Send message
Joined: 2 Jan 13
Posts: 7457
Credit: 42,792,827
RAC: 4,918
Message 9623 - Posted: 4 Feb 2024, 10:29:55 UTC - in response to Message 9621.

Well there's no heat coming from the card and MSI afterburner shows 0% usage. Boinc and the text file claim it's progressing, I assume the end result will be invalid. The one still running will be done in 7 minutes.


I cant find any useful logs. The GPU card was recognized, the selftest was successful und kernel compilation completed and run.

We can try a standalone test if the 2nd card is running. I dont trust BOINC 7.24.1 so we can do the test without BOINC.

1 · 2 · 3 · 4 · Next
Post to thread

Message boards : News : New TF multiGPU apps deployed (issues fixed)


Main page · Your account · Message boards


Copyright © 2014-2024 BOINC Confederation / rebirther