Posts by Skip Da Shu
log in
1) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9862)
Posted 25 Mar 2024 by Skip Da Shu
1st one completed and returned from RX6800.

https://srbase.my-firewall.org/sr5/result.php?resultid=147398812


Another is running on the RX6800XT now.

So was it a null assigned to $GPU_DEVICE_NUM in job_TF_l64m1_00039.xml?



2 done and reported from RX6800XT. They are sub 4 minutes clock time on this card.

https://srbase.my-firewall.org/sr5/result.php?resultid=147174011

Thank you so much for walking thru the testing with us.

tschuess, Skip
2) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9860)
Posted 25 Mar 2024 by Skip Da Shu
1st one completed and returned from RX6800.

https://srbase.my-firewall.org/sr5/result.php?resultid=147398812


Another is running on the RX6800XT now.

So was it a null assigned to $GPU_DEVICE_NUM in job_TF_l64m1_00039.xml?
3) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9853)
Posted 23 Mar 2024 by Skip Da Shu
klingt gut, Chef!
4) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9851)
Posted 23 Mar 2024 by Skip Da Shu
Yes, counted from 0... back when I had a 5700G APU/CPU the discrete card was 0 & the iGPU was 1. I excluded the iGPU in cc_config.xml.

I'm still curious how $GPU_DEVICE_NUM is set.
5) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9848)
Posted 23 Mar 2024 by Skip Da Shu
Trying to show what that variable is set to (that s/b = 0) by going into the boinc folder where srbase runs from .... as root.. gives me nothing back so I assume it's set to null.

root@sharon-ES7C95:/var/lib/boinc-client/projects/srbase.my-firewall.org_sr5# echo $GPU_DEVICE_NUM
6) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9846)
Posted 23 Mar 2024 by Skip Da Shu
/home/sharon/mfacto/mfakto-wrappertest/

has these files in it:
barrett15.cl barrett.cl boinc_finish_called common.cl COPYING datatypes.h gpusieve.cl job.xml mfakto.ini mfakto_Kernels.cl mfakto_Kernels.elf mfakto-x64 montgomery.cl mul24.cl README.txt results.txt stderr.txt tf_debug.h todo.txt worktodo.txt wrapper_26018_linux-u18_x86-64 wrapper_checkpoint.txt wrapperdir
7) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9844)
Posted 23 Mar 2024 by Skip Da Shu
What files? R we talking wrapper-test or BOINC world?
8) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9842)
Posted 23 Mar 2024 by Skip Da Shu
In terminal I ran:

sharon@sharon-ES7C95:~/mfakto/mfakto-wrappertest$ sudo ./wrapper_26018_linux-u18_x86-64



mfakto/mfakto-wrappertest folder has this in job.xml:

<job_desc> <task> <application>./mfakto-x64</application> <command_line>-d 0</command_line> </task> <unzip_input> <zipfilename>mfakto-linux64-v10.zip</zipfilename> </unzip_input> </job_desc>


sharon@sharon-ES7C95:~/mfakto/mfakto-wrappertest$ echo $GPU_DEVICE_NUM
Doesn't display anything visible... I was hoping to see a #

Do you know where this gets set?
9) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9840)
Posted 23 Mar 2024 by Skip Da Shu
<command_line>-d 0</command_line>

I'll assume this in the BOINC world comes from the file "job_TF_l64m1_00029.xml" which has

<job_desc> <task> <application>./mfakto-x64</application> <command_line>-d $GPU_DEVICE_NUM</command_line> </task> <unzip_input> <zipfilename>mfakto-linux64-v10.zip</zipfilename> </unzip_input> </job_desc>
10) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9838)
Posted 23 Mar 2024 by Skip Da Shu
Yes.

It has one line in it:

no factor for M758604829 from 2^73 to 2^74 [mfakto 0.15pre8 cl_barrett15_74_gs_2]

Releasing a suspended TF task in BOINC still gets this:

https://srbase.my-firewall.org/sr5/result.php?resultid=146304037
11) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9836)
Posted 23 Mar 2024 by Skip Da Shu
I re-ran the 'wrapper' test as sudo and it finds the GPU! After a couple minutes it gets some sort of error with worktodo.txt but it was running on the GPU as I could see the load go up in corectrl.

I forgot to kill BOINC before running this so it ran while 2 E@H tasks were also processing on the RX6800.

Let me know what you want me to do next boss.

sharon@sharon-ES7C95:~/mfakto/mfakto-wrappertest$ sudo ./wrapper_26018_linux-u18_x86-64 [sudo] password for sharon: 2024-03-23 10:16:32 (8297): wrapper (7.24.26018): starting mfakto 0.15pre8 (64-bit build) Runtime options INI file mfakto.ini Verbosity 1 SieveOnGPU yes MoreClasses yes GPUSievePrimes 82486 GPUSieveProcessSize 24 Kib GPUSieveSize 96 Mib FlushInterval 8 WorkFile worktodo.txt ResultsFile results.txt Checkpoints enabled CheckpointDelay 60 s Stages enabled StopAfterFactor class PrintMode compact V5UserID none ComputerID none TimeStampInResults no VectorSize 2 GPUType AUTO SmallExp no UseBinfile mfakto_Kernels.elf Compile-time options Select device - Get device info: OpenCL device info name gfx1030 (Advanced Micro Devices, Inc.) device (driver) version OpenCL 2.0 (3513.0 (HSA1.1,LC)) maximum threads per block 1024 maximum threads per grid 1073741824 number of multiprocessors 30 (1920 compute elements) clock rate 2475 MHz Automatic parameters threads per grid 0 optimizing kernels for RDNA Compiling kernels. GPUSievePrimes (adjusted) 82486 GPUsieve minimum exponent 1055144 Started a simple self-test ... Self-test statistics number of tests 30 successful tests 30 self-test PASSED! got assignment: exp=758604829 bit_min=73 bit_max=74 (2.52 GHz-days) Starting trial factoring M758604829 from 2^73 to 2^74 (2.52 GHz-days) Using GPU kernel "cl_barrett15_74_gs_2" Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Mar 23 10:19 | 4619 100.0% | 0.162 0m00s | 1400.98 82486 0.00% no factor for M758604829 from 2^73 to 2^74 [mfakto 0.15pre8 cl_barrett15_74_gs_2] tf(): total time spent: 2m 33.955s (1415.22 GHz-days / day) ERROR: get_next_assignment(): no valid assignment found in "worktodo.txt"
12) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9835)
Posted 23 Mar 2024 by Skip Da Shu
I went back and re-ran the original stand-alone test we did.

root@sharon-ES7C95:/var/lib/boinc-client/projects/srbase.my-firewall.org_sr5/mfakto-test/mfakto-linux64-v10# ./mfakto-x64 -st > test.log


As you can see this one was run in a test folder (mfakto-linux64-v10) under the srbase projects directory of BOINC (BOINC was running) and again it finds the GPU and completes. This is the bottom of test.log:

######### test case 34071/34071 (M112404491[91-92]) ######### Starting trial factoring M112404491 from 2^91 to 2^92 (4461450.54 GHz-days) Date Time | class Pct | time ETA | GHz-d/day Sieve Wait Mar 23 09:40 | 1848 0.1% | 0.009 n.a. | n.a. 82486 0.00% M112404491 has a factor: 3941616367695054034124905537 (91.670846 bits, 2992945.937358 GHz-d) found 1 factor for M112404491 from 2^91 to 2^92 [mfakto 0.15pre8 cl_barrett32_92_gs_2] self-test for M112404491 passed (cl_barrett32_92_gs)! tf(): total time spent: 0.009s Self-test statistics number of tests 34026 successful tests 34026 self-test PASSED!


The 'wrapper' test and all since where it doesn't find the GPU was run in test folders under my home directory.

Here's a list of what's in srbase.my-firewall.org_sr5/mfakto-test/mfakto-linux64-v10:

barrett15.cl
barrett.cl
Changelog-mfakto.txt
common.cl
COPYING
datatypes.h
gpusieve.cl
mfakto.ini
mfakto_Kernels.cl
mfakto_Kernels.elf
mfakto-x64
montgomery.cl
mul24.cl
project_dir
README-SpecialVersions.txt
README.txt
test.log
test.log.old
tf_debug.h
todo.txt

I don't know what's inherited from /var/lib/boinc-client or projects/srbase.my-firewall.org_sr5 that allows the GPU to be found when running from here.

I will re-try the wrapper test using sudo just to make sure it's not the root privileges that's allowing it to be found.


Skip
13) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9833)
Posted 23 Mar 2024 by Skip Da Shu
IF I understand this last test was back to stand-alone w/o the wrapper and it now did not find the GPU. Whereas previously the GPU wasn't found only when we ran inside the wrapper. Is this correct?
14) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9831)
Posted 23 Mar 2024 by Skip Da Shu


Nope, try to download mfakto-linux64-v9.zip It contains the older mfakto file, replace it with yours. Also overwrite all .cl files.


1) I unzipped this downloaded zip creating a new test folder (mfakto-linux64-v9).
2) Replaced the mfakto-64 in this new folder with mfakto-64 from my 'wrapper=test' folder.
3) Replaced the .cl files in this new folder with wrapper-test's 7 .cl files.
5) Executed this --
ES7C95:~/mfakto/mfakto-linux64-v9$ ./mfakto-x64

6) Got the following but feel like I regressed...

mfakto 0.15pre8 (64-bit build)


Runtime options INI file mfakto.ini Verbosity 1 SieveOnGPU yes MoreClasses yes GPUSievePrimes 82486 GPUSieveProcessSize 24 Kib GPUSieveSize 96 Mib FlushInterval 8 WorkFile worktodo.txt ResultsFile results.txt Checkpoints enabled CheckpointDelay 60 s Stages enabled StopAfterFactor class PrintMode compact V5UserID none ComputerID none TimeStampInResults no VectorSize 2 GPUType AUTO SmallExp no UseBinfile mfakto_Kernels.elf Compile-time options Select device - GPU not found, fallback to CPU. Error -1 (Device not found): clCreateContextFromType(CPU) ERROR: init_CL(3, 0) failed
15) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9828)
Posted 22 Mar 2024 by Skip Da Shu
Is this of any help?
*-pci description: PCI bridge product: Navi 10 XL Downstream Port of PCI Express Switch vendor: Advanced Micro Devices, Inc. [AMD/ATI] physical id: 0 bus info: pci@0000:2c:00.0 logical name: /dev/fb0 version: 00 width: 64 bits clock: 33MHz capabilities: pci pm pciexpress msi normal_decode bus_master cap_list fb configuration: depth=32 driver=pcieport mode=1920x1080 visual=truecolor xres=1920 yres=1080 resources: iomemory:e1e10-e1e0f irq:39 ioport:e000(size=4096) memory:fc900000-fcafffff ioport:7800000000(size=17448304640) *-display description: VGA compatible controller product: Navi 21 [Radeon RX 6800/6800 XT / 6900 XT] vendor: Advanced Micro Devices, Inc. [AMD/ATI] physical id: 0 bus info: pci@0000:2d:00.0 logical name: /dev/fb0 version: c3 width: 64 bits clock: 33MHz capabilities: pm pciexpress msi vga_controller bus_master cap_list rom fb configuration: depth=32 driver=amdgpu latency=0 resolution=1920,1080
16) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9827)
Posted 22 Mar 2024 by Skip Da Shu
This box has only the single RX6800 in the first (top) x16 slot.

-d 0
-d 1
-d 2

all produced same error:

2024-03-22 16:13:19 (8700): Can't open init data file - running in standalone mode 2024-03-22 16:13:19 (8700): wrapper (7.24.26018): starting 2024-03-22 16:13:19 (8700): wrapper: running ./mfakto-x64 (-d 2) 2024-03-22 16:13:19 (8700): wrapper: created child process 8703 Error -1 (Device not found): clCreateContextFromType(CPU) 2024-03-22 16:13:20 (8700): ./mfakto-x64 exited; CPU time 0.042067 2024-03-22 16:13:20 (8700): app exit status: 0x2 2024-03-22 16:13:20 (8700): called boinc_finish(195)


There must be another format for specifying the GPU device??

Skip


try to remove the -d entry in job.xml. Its odd why it cant find your GPU.


"wrapper: running ./mfakto-x64 (-d 2)" becomes "wrapper: running ./mfakto-x64 ()" and same error.
17) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9817)
Posted 22 Mar 2024 by Skip Da Shu
This box has only the single RX6800 in the first (top) x16 slot. BOINC calls it device 0.

-d 0
-d 1
-d 2

all produced same error:

2024-03-22 16:13:19 (8700): Can't open init data file - running in standalone mode 2024-03-22 16:13:19 (8700): wrapper (7.24.26018): starting 2024-03-22 16:13:19 (8700): wrapper: running ./mfakto-x64 (-d 2) 2024-03-22 16:13:19 (8700): wrapper: created child process 8703 Error -1 (Device not found): clCreateContextFromType(CPU) 2024-03-22 16:13:20 (8700): ./mfakto-x64 exited; CPU time 0.042067 2024-03-22 16:13:20 (8700): app exit status: 0x2 2024-03-22 16:13:20 (8700): called boinc_finish(195)


There must be another format for specifying the GPU device??

Skip
18) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9813)
Posted 22 Mar 2024 by Skip Da Shu
stderr:

2024-03-22 16:01:34 (8214): Can't open init data file - running in standalone mode 2024-03-22 16:01:34 (8214): wrapper (7.24.26018): starting 2024-03-22 16:01:34 (8214): wrapper: running ./mfakto-x64 (-d 0) 2024-03-22 16:01:34 (8214): wrapper: created child process 8216 Error -1 (Device not found): clCreateContextFromType(CPU) 2024-03-22 16:01:35 (8214): ./mfakto-x64 exited; CPU time 0.049071 2024-03-22 16:01:35 (8214): app exit status: 0x2 2024-03-22 16:01:35 (8214): called boinc_finish(195)



terminal:

ES7C95:~/mfakto/mfakto-wrappertest$ ./wrapper_26018_linux-u18_x86-64 2024-03-22 16:01:34 (8214): wrapper (7.24.26018): starting mfakto 0.15pre8 (64-bit build) Runtime options INI file mfakto.ini Verbosity 1 SieveOnGPU yes MoreClasses yes GPUSievePrimes 82486 GPUSieveProcessSize 24 Kib GPUSieveSize 96 Mib FlushInterval 8 WorkFile worktodo.txt ResultsFile results.txt Checkpoints enabled CheckpointDelay 60 s Stages enabled StopAfterFactor class PrintMode compact V5UserID none ComputerID none TimeStampInResults no VectorSize 2 GPUType AUTO SmallExp no UseBinfile mfakto_Kernels.elf Compile-time options Select device - GPU not found, fallback to CPU. ERROR: init_CL(3, 0) failed


Seems like it doesn't like either one,
19) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9811)
Posted 22 Mar 2024 by Skip Da Shu
stderr.txt:

2024-03-22 15:49:09 (7750): Can't open init data file - running in standalone mode 2024-03-22 15:49:09 (7750): wrapper (7.24.26018): starting 2024-03-22 15:49:09 (7750): wrapper: running ./mfakto-x64 (-device 0) 2024-03-22 15:49:09 (7750): wrapper: created child process 7754 ERROR: unknown option '-device' 2024-03-22 15:49:10 (7750): ./mfakto-x64 exited; CPU time 0.000000 2024-03-22 15:49:10 (7750): app exit status: 0x1 2024-03-22 15:49:10 (7750): called boinc_finish(195)
20) Message boards : Number crunching : Linux TF WUs on ATI 6800 cards (Message 9805)
Posted 22 Mar 2024 by Skip Da Shu
Yes, but end results still the same:

https://srbase.my-firewall.org/sr5/result.php?resultid=146304058

Let me know what I need to do next.

Thanx, Skip


Next 20

Main page · Your account · Message boards


Copyright © 2014-2024 BOINC Confederation / rebirther