Posts by DeleteNull
log in
1) Message boards : Cafe : Rebirther is currently in hospital. (Message 9913)
Posted 9 days ago by DeleteNull
Hello @all:

Rebirther is currently in hospital. He can write, e.g. in Discord, but is unable to manage the server.

So the upcoming challenge isn't possible.

Recovery hopefully will be possible, but no time schedule yet.

Regards
DeleteNull
2) Message boards : Number crunching : Nvidia Tesla P100 Problems (Message 9761)
Posted 17 Mar 2024 by DeleteNull
That fixed one of my PCs.

Another still errors out on every task
https://srbase.my-firewall.org/sr5/result.php?resultid=145885980


Can you update your driver on this host?
3) Message boards : News : New TF multiGPU apps deployed (issues fixed) (Message 9667)
Posted 7 Feb 2024 by DeleteNull
Are you sure Boinc is in the right order? I see it give a random order compared to say MSI Afterburner. I had a machine with 6 cards in it, and Boinc's order compared to MSI was something like 3, 5, 1, 2, 0, 4. I assumed it was physical connection order vs. driver loading order. Although Folding@Home is different again. Very annoying when trying to allocate cards or work out which is overheating.


Yes in my case with "only" two GPU BOINC has the right order. The GPU where the boot screen appears is GPU0 and the other (dark) one is GPU1. The same order as the BIOS allocates the screens.
4) Message boards : News : New TF multiGPU apps deployed (issues fixed) (Message 9665)
Posted 7 Feb 2024 by DeleteNull
Just made a test for two AMD GPU in a windows host. 5500XT and 6500XT.

The wrapper works well, device 0 has this message:
2024-02-07 13:34:18 (8960): wrapper (7.24.26018): starting
2024-02-07 13:34:18 (8960): wrapper: running mfakto.exe (-d 0)
2024-02-07 13:34:18 (8960): wrapper: created child process 2012
mfakto 0.15pre7-MGW (64bit build)

device 1 has this message:
2024-02-07 13:34:19 (6484): wrapper (7.24.26018): starting
2024-02-07 13:34:19 (6484): wrapper: running mfakto.exe (-d 1)
2024-02-07 13:34:19 (6484): wrapper: created child process 5128
mfakto 0.15pre7-MGW (64bit build)

What does mfakto do?
device 0:
OpenCL device info
name gfx1034 (Advanced Micro Devices, Inc.)
device (driver) version OpenCL 2.0 AMD-APP (3592.0) (3592.0 (PAL,LC))
gfx1034 => 6500XT

device1:
OpenCL device info
name gfx1012:xnack- (Advanced Micro Devices, Inc.)
device (driver) version OpenCL 2.0 AMD-APP (3592.0) (3592.0 (PAL,LC))
gfx1012 => 5500XT

This is a bit confusing because BOINC tells this:
07.02.2024 13:32:57 | | OpenCL: AMD/ATI GPU 0: Radeon RX 5500 XT (driver version 3592.0 (PAL,LC), device version OpenCL 2.0 AMD-APP (3592.0), 8176MB, 8176MB available, 4891 GFLOPS peak)
07.02.2024 13:32:57 | | OpenCL: AMD/ATI GPU 1: AMD Radeon RX6500 XT (driver version 3592.0 (PAL,LC), device version OpenCL 2.0 AMD-APP (3592.0), 4080MB, 4080MB available, 5499 GFLOPS peak)

So this is the opposite assignement to that was mfakto does.

The calculation in fact is done only with one GPU: the 5500XT (e.g. GPU-Z)

So there seems a bug in mfakto, not in the wrapper.

It is not a driver issue, PrimeGrid uses both GPU in parallel.
5) Message boards : Number crunching : Question/Problem with Intel GPU work units (Message 9662)
Posted 6 Feb 2024 by DeleteNull
The main problem is this (mfactc.c):
GPU_type gpu_types[]={
{GPU_AUTO, 0, "AUTO"},
{GPU_VLIW4, 64, "VLIW4"},
{GPU_VLIW5, 80, "VLIW5"},
{GPU_GCN, 64, "GCN"},
{GPU_GCN2, 64, "GCN2"},
{GPU_GCN3, 64, "GCN3"},
{GPU_GCN4, 64, "GCN4"},
{GPU_GCN5, 64, "GCN5"},
{GPU_GCNF, 64, "GCNF"},
{GPU_RDNA, 64, "RDNA"},
{GPU_APU, 80, "APU"},
{GPU_CPU, 1, "CPU"},
{GPU_NVIDIA, 8, "NVIDIA"},
{GPU_INTEL, 1, "INTEL"},
{GPU_UNKNOWN, 0, "UNKNOWN"}
};

Depending on different values for the integer in {GPU_INTEL, 1, "INTEL"}, you will get different results for the number of successful and "not successfull" self tests. But will never get it to 0 of not successful?
6) Message boards : News : A bad day happened (Message 9159)
Posted 15 Oct 2023 by DeleteNull
Just read this in Discord.
Get well soon, and (hopefully) recover all what is broken/damaged!
7) Message boards : Number crunching : Interest in adding a new subproject? (Message 9042)
Posted 15 Sep 2023 by DeleteNull
Just tested. It is slower (NVidia 4080) than a CPU.
8) Message boards : Number crunching : 2 GPUs, 2 tasks on one card, not utilizing gpu 1, (Message 8638)
Posted 7 Jan 2023 by DeleteNull
As far as I know: BOINC passes e.g. "-device 1" to the device so may be we have to update the code that it understands -device instead of -d.
9) Message boards : Number crunching : 2 GPUs, 2 tasks on one card, not utilizing gpu 1, (Message 8636)
Posted 7 Jan 2023 by DeleteNull
Hello, this is the content of a job file:
../../projects/srbase.my-firewall.org_sr5/job_TF_l64c_00020.xml

<job_desc>
<task>
<application>./mfaktc.exe</application>
<append_cmdline_args/>
</task>
<unzip_input>
<zipfilename>mfaktc-linux64-v6.zip</zipfilename>
</unzip_input>
</job_desc>

if you want a device number you have to add a -d <number> parameter (default is 0)

Usage: ./mfaktc.exe [options]
-h display this help and exit
-d <device number> specify the device number used by this program
-tf <exp> <min> <max> trial factor M<exp> from 2^<min> to 2^<max> and exit
instead of parsing the worktodo file
-st run builtin selftest and exit
-st2 same as -st but extended range for k_min/m_max
-v <number> set verbosity (min = 0, default = 1, more = 2, max/debug = 3)
10) Message boards : Number crunching : Trial Factoring (Message 8594)
Posted 30 Dec 2022 by DeleteNull
Hello,

you have to copy the needed libs into your library folder (for me /lib64).
then create a link:
e.g.:
lrwxrwxrwx 1 root root 21 29. Dez 23:46 libcudart.so.12 -> libcudart.so.12.0.107
lrwxrwxrwx 1 root root 21 29. Dez 23:52 libcudart.so.12.0 -> libcudart.so.12.0.107
-rwxr-xr-x 1 root root 675168 29. Dez 23:45 libcudart.so.12.0.107

after this is done mfactc.exe can execute the self test:

Selftest statistics
number of tests 26192
successfull tests 26192

kernel | success | fail
-------------------+---------+-------
UNKNOWN kernel | 0 | 0
71bit_mul24 | 2586 | 0
75bit_mul32 | 2682 | 0
95bit_mul32 | 2867 | 0
barrett76_mul32 | 1096 | 0
barrett77_mul32 | 1114 | 0
barrett79_mul32 | 1153 | 0
barrett87_mul32 | 1066 | 0
barrett88_mul32 | 1069 | 0
barrett92_mul32 | 1084 | 0
75bit_mul32_gs | 2420 | 0
95bit_mul32_gs | 2597 | 0
barrett76_mul32_gs | 1079 | 0
barrett77_mul32_gs | 1096 | 0
barrett79_mul32_gs | 1130 | 0
barrett87_mul32_gs | 1044 | 0
barrett88_mul32_gs | 1047 | 0
barrett92_mul32_gs | 1062 | 0

selftest PASSED!
11) Message boards : Number crunching : Linux GPU errors (Message 7014)
Posted 23 Nov 2020 by DeleteNull

Have you tried running multiple tasks at the same time?


No, I haven't. Performance is outstanding, so there is no need to do so (for me).
12) Message boards : Number crunching : Linux GPU errors (Message 7002)
Posted 22 Nov 2020 by DeleteNull
Depending on the flavour of Linux you are using you have to choose the right directory.
For Ubuntu (and derivatives) it is /usr/lib and in OpenSuse it is /usr/lib64.
But there are some other Linux Distros to confuse the user ;)
13) Message boards : Number crunching : mfaktc.exe --perftest for RTX3080/90 cards (Message 6933)
Posted 12 Nov 2020 by DeleteNull
o.k., thanks. With this number the result is:
no factor for M219111587 from 2^70 to 2^71 [mfaktc 0.21 barrett76_mul32_gs]
tf(): total time spent: 19.759s
14) Message boards : Number crunching : mfaktc.exe --perftest for RTX3080/90 cards (Message 6931)
Posted 12 Nov 2020 by DeleteNull
~/mfakt/compile/mfaktc-master> ./mfaktc.exe --perftest
mfaktc v0.21 (64bit built)

Compiletime options
THREADS_PER_BLOCK 256
SIEVE_SIZE_LIMIT 32kiB
SIEVE_SIZE 193154bits
SIEVE_SPLIT 250
MORE_CLASSES enabled

Runtime options
SievePrimes 25000
SievePrimesAdjust 1
SievePrimesMin 5000
SievePrimesMax 100000
NumStreams 3
CPUStreams 3
GridSize 3
GPU Sieving enabled
GPUSievePrimes 82486
GPUSieveSize 64Mi bits
GPUSieveProcessSize 16Ki bits
Checkpoints enabled
CheckpointDelay 30s
WorkFileAddDelay 600s
Stages enabled
StopAfterFactor bitlevel
PrintMode full
V5UserID (none)
ComputerID (none)
AllowSleep no
TimeStampInResults no

CUDA version info
binary compiled for CUDA 11.10
CUDA runtime version 11.10
CUDA driver version 11.10

CUDA device info
name GeForce RTX 3080
compute capability 8.6
max threads per block 1024
max shared memory per MP 102400 byte
number of multiprocessors 68
clock rate (CUDA cores) 1815MHz
memory clock rate: 9501MHz
memory bus width: 320 bit

Automatic parameters
threads per grid 557056
GPUSievePrimes (adjusted) 82486
GPUsieve minimum exponent 1055144

running a simple selftest...
Selftest statistics
number of tests 107
successfull tests 107

selftest PASSED!

ERROR: get_next_assignment(): no valid assignment found in "worktodo.txt"

which value shall worktodo.txt have?
15) Message boards : Number crunching : Trial Factoring tests (Message 6235)
Posted 15 Apr 2020 by DeleteNull
I've patched mfakto, not mfaktc.
So the mapping --device 0 to -d 00 is only implemented for AMD, not NVidia.
(and only linux)
16) Message boards : Number crunching : Trial Factoring tests (Message 6233)
Posted 15 Apr 2020 by DeleteNull
o.k., you can find the updated wrapper.cpp here:
https://p-numbers.net/wrapper.cpp

Works for mfakto (mfaktc), but will not work for LLR.
17) Message boards : Number crunching : Trial Factoring tests (Message 6231)
Posted 15 Apr 2020 by DeleteNull
Do you have the complete wrapper.cpp file?


The new wrapper works as expected.
Task starts with 0%, switches to 100% after one minute.
This is because there is no Mxxxxxx.ckp file in the first 5 minutes.

After 5 minutes it switches to the percentage of the last Mxxxxxx.ckp.
The remaining time increases (because the percentage stays at the old level for 5 minutes)
After 10 minutes is the next update, and then every 5 minutes.

Is this o.k., or shall I implement a smoother calculation of the percentage (a few more steps in the 5 minute interval)?
18) Message boards : Number crunching : Trial Factoring tests (Message 6225)
Posted 15 Apr 2020 by DeleteNull
Theoretical both applicatins can support more than one (different) GPU.
But: BOINC enumerates the GPU with 0, 1, 2, ....
In OpenCl you have platforms, e.g. Intel=0, AMD=1, NVidia=2, and for each platform 1..n devices GPU.

A mapping form 0, 1, 2 to 00, 10, 11 is different for each computer with more than one graphics device.

So there is currently only one mapping --device 0 to d 00. (possible)

Sorry to interject your development process.

Why are you using "--device x" for mfakto and mfaktc ?
Both their guides say to use "-d x"

Q: Does mfakto support multiple GPUs?
A: No, but you can use the -d option to tell an instance to run on a specific
device. Please also read the next question.

Q Does mfaktc support multiple GPUs?
A Yes, with the exception that a single instance of mfaktc can only use one
GPU. For each GPU you want to run mfaktc on you need (at least) one
instance of mfaktc. For each instance of mfaktc you can use the
commandline option "-d <GPU number>" to specify which GPU to use for each
specific mfaktc instance. Please read the next question, too.

I can run a second task on a second GPU but only if I specify "-d x", as soon as I pass "--device x" (or "-device x") on the command line it defaults to the first GPU.

Can you use "-d x" for linux wrapper at least please ?
19) Message boards : Number crunching : Trial Factoring tests (Message 6224)
Posted 15 Apr 2020 by DeleteNull
No issues.
In about an hour an "old" WU is finished, the new WU should use the new wrapper and I can report.

I am not a C/C++ developer. The new method just reads the worktodo.txt with the number and contructs the file name for the checkpoint file of mfaktc/mfakto and calculates the fraction of "work done". This can be used by the method fraction_done() of the (original) wrapper.

If it succeeds I will send you the wrapper.cpp so it can be used for Windows too.
20) Message boards : Number crunching : Trial Factoring tests (Message 6220)
Posted 15 Apr 2020 by DeleteNull
What ending is exe?
The file is named wrapper_26016_7.16_mfakt
It is linux executable (-rwxr-xr-x)

The new method is:
double getMfraction() { FILE* f = fopen("worktodo.txt", "r"); if (!f) return 0; char* p; char buf[256]; p = fgets(buf, 256, f); fclose(f); if (p == NULL) return 0; char * qch; qch = strtok(p, "="); qch = strtok(NULL, ","); int len = strlen(qch); char mfile[6+len]; mfile[0] = 'M'; int i; for (i = 0; i < len; i++) { mfile[1+i] = qch[i]; } mfile[1+len] = '.'; mfile[2+len] = 'c'; mfile[3+len] = 'k'; mfile[4+len] = 'p'; mfile[5+len] = '\0'; FILE* ff = fopen(mfile, "r"); if (!ff) return 0; char* pp; pp = fgets(buf, 256, ff); fclose(ff); if (pp == NULL) return 0; char * pch; int count = 0; double all = 0, done = 0, frac = 0; pch = strtok (pp, " "); while (pch != NULL) { count++; if (count == 4) all = atof(pch); if (count == 7) done = atof(pch); pch = strtok (NULL, " "); } if (all > 0) frac = done / all; if (frac < 0) return 0; if (frac > 1) return 1; return frac; }


Next 20

Main page · Your account · Message boards


Copyright © 2014-2024 BOINC Confederation / rebirther