Thread 'Geppetto Test 001 - Segnalazioni e Commenti'

Message boards : Number crunching : Geppetto Test 001 - Segnalazioni e Commenti
Message board moderation

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
Matteo Rinaldi
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Help desk expert

Send message
Joined: 22 Jan 25
Posts: 2
Credit: 4
RAC: 0
Message 1 - Posted: 5 Feb 2025, 22:24:27 UTC

Thread ufficiale per discutere del primo test dell'applicazione "Geppetto Boinc"
ID: 1 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
dave

Send message
Joined: 7 Feb 25
Posts: 4
Credit: 807
RAC: 80
Message 4 - Posted: 7 Feb 2025, 12:53:31 UTC - in response to Message 1.  
Last modified: 7 Feb 2025, 13:03:32 UTC

Downloaded docker and then ran
sudo ln -s /usr/bin/docker /bin/unknown


All tasks result in error.

I have installed buildx and will try again.
Still getting errors.
running docker command: ps --all --filter "name=boinc__boinc.llmentor.org_llmentorgrid__batch_18__job_wu525_0"
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Get "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/containers/json?all=1&filters=%7B%22name%22%3A%7B%22boinc__boinc.llmentor.org_llmentorgrid__batch_18__job_wu525_0%22%3Atrue%7D%7D": dial unix /var/run/docker.sock: connect: permission denied
command output:
creating container boinc__boinc.llmentor.org_llmentorgrid__batch_18__job_wu525_0
running docker command: images
ID: 4 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
entity

Send message
Joined: 6 Feb 25
Posts: 8
Credit: 12,258
RAC: 876
Message 5 - Posted: 7 Feb 2025, 14:54:03 UTC - in response to Message 4.  
Last modified: 7 Feb 2025, 14:55:51 UTC

Installed on Fedora 41 system and all worked successfully. Work unit ran for 4 minutes wall clock. Used 16.6G virtual and 1.1G resident memory. During python phase it used as many as 26 processors at times.

Note: the front page is missing the step identified by sabayonine to add boinc to the docker group usermod -aG docker boinc. The system will need to be rebooted after the usermod command to set the group change.
ID: 5 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
dave

Send message
Joined: 7 Feb 25
Posts: 4
Credit: 807
RAC: 80
Message 6 - Posted: 7 Feb 2025, 15:06:48 UTC - in response to Message 5.  
Last modified: 7 Feb 2025, 15:22:06 UTC

Note: the front page is missing the step identified by sabayonine to add boinc to the docker group usermod -aG docker boinc. The system will need to be rebooted after the usermod command to set the group change.


Thanks, will try again after changing permissions.

Edit: Done that but now no tasks available.
ID: 6 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
entity

Send message
Joined: 6 Feb 25
Posts: 8
Credit: 12,258
RAC: 876
Message 7 - Posted: 7 Feb 2025, 15:34:03 UTC - in response to Message 6.  

On the EPYC server, I have limited the other boinc work to 64 processors (approximately) and created an app_config.xml file for the LLMentorGrid project limiting it to 1 work unit at a time. LLMentorGrid work can have access to the other 64 threads. I'll run with this configuration for a while to see how everything responds and to determine if there is any slow down in the other work.
ID: 7 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
manalog
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Help desk expert

Send message
Joined: 22 Jan 25
Posts: 12
Credit: 56
RAC: 5
Message 9 - Posted: 7 Feb 2025, 18:01:06 UTC - in response to Message 7.  

That's interesting because I hadn't had the possibility to test it on machines with many threads. I will probably choose the plan class "mt" for this application so that it's going to use multithreading, now I think it's kind of cheating because even though it is set as single thread then the docker will use all the available threads.
ID: 9 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Corla99 [Lombardia]
Avatar

Send message
Joined: 6 Feb 25
Posts: 3
Credit: 9,614
RAC: 929
Message 10 - Posted: 7 Feb 2025, 20:06:08 UTC - in response to Message 5.  

In reply to entity's message of 7 Feb 2025:
[...]Note: the front page is missing the step identified by sabayonine to add boinc to the docker group usermod -aG docker boinc. The system will need to be rebooted after the usermod command to set the group change.


You don't need to restart your computer, you can just restart the docker service.
For ubuntu:
sudo systemctl restart boinc-client.service
ID: 10 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Corla99 [Lombardia]
Avatar

Send message
Joined: 6 Feb 25
Posts: 3
Credit: 9,614
RAC: 929
Message 11 - Posted: 7 Feb 2025, 20:18:39 UTC

Tried another batch of wu...and new errors for me.

The host is a dual xeon E5-2699 v3 with ubuntu 24.04.1

The wu run for a while then it exits with an error saying
running docker command: create --name boinc__boinc.llmentor.org_llmentorgrid__batch_19__job_wu1149_0  -v .:/app  -e ARGS="" boinc_geppetto_test_ver_1
Error response from daemon: create .: volume name is too short, names should be at least two alphanumeric characters
command output:
starting container
running docker command: start boinc__boinc.llmentor.org_llmentorgrid__batch_19__job_wu1149_0
Error response from daemon: No such container: boinc__boinc.llmentor.org_llmentorgrid__batch_19__job_wu1149_0
Error: failed to start containers: boinc__boinc.llmentor.org_llmentorgrid__batch_19__job_wu1149_0


Example 1 and Example 2
Also, in the stderr all containers on the host will be shown. Including those not related to the project
ID: 11 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
Corla99 [Lombardia]
Avatar

Send message
Joined: 6 Feb 25
Posts: 3
Credit: 9,614
RAC: 929
Message 12 - Posted: 7 Feb 2025, 21:17:06 UTC - in response to Message 10.  

In reply to Corla99 [Lombardia]'s message of 7 Feb 2025:
You don't need to restart your computer, you can just restart the docker service. For ubuntu:
sudo systemctl restart boinc-client.service


I pasted the wrong command, this is the correct one:

sudo systemctl restart docker.service
ID: 12 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
entity

Send message
Joined: 6 Feb 25
Posts: 8
Credit: 12,258
RAC: 876
Message 13 - Posted: 7 Feb 2025, 21:30:55 UTC - in response to Message 12.  
Last modified: 7 Feb 2025, 21:41:09 UTC

Good to know. Thanks

That makes it quite a bit simpler and less intrusive

Side Note: Doesn't seem to be picked up by WUProp@Home. Probably because it doesn't run under the boinc id but as root in the docker container or because it runs as something WUProp doesn't recognize.
ID: 13 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 7 Feb 25
Posts: 3
Credit: 358,712
RAC: 21,618
Message 14 - Posted: 8 Feb 2025, 1:28:59 UTC - in response to Message 12.  

In reply to Corla99 [Lombardia]'s message of 7 Feb 2025:
In reply to Corla99 [Lombardia]'s message of 7 Feb 2025:
You don't need to restart your computer, you can just restart the docker service. For ubuntu:
sudo systemctl restart boinc-client.service


I pasted the wrong command, this is the correct one:

sudo systemctl restart docker.service


Tried that still getting all errors, even rebooted tried it again but still errors ...

sudo systemctl restart docker.service

steve@PBOYZTOY02L:~$ sudo systemctl restart docker.service
[sudo] password for steve:
Failed to restart docker.service: Unit docker.service not found.
ID: 14 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mmonnin

Send message
Joined: 8 Feb 25
Posts: 11
Credit: 506,452
RAC: 41,992
Message 15 - Posted: 8 Feb 2025, 2:29:39 UTC - in response to Message 14.  
Last modified: 8 Feb 2025, 2:30:23 UTC

When trying to set usermod I get this on several PCs.
usermod: group 'docker' does not exist
ID: 15 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
entity

Send message
Joined: 6 Feb 25
Posts: 8
Credit: 12,258
RAC: 876
Message 16 - Posted: 8 Feb 2025, 3:22:25 UTC - in response to Message 14.  

In reply to STE\/E's message of 8 Feb 2025:
In reply to Corla99 [Lombardia]'s message of 7 Feb 2025:
In reply to Corla99 [Lombardia]'s message of 7 Feb 2025:
You don't need to restart your computer, you can just restart the docker service. For ubuntu:
sudo systemctl restart boinc-client.service


I pasted the wrong command, this is the correct one:

sudo systemctl restart docker.service


Tried that still getting all errors, even rebooted tried it again but still errors ...

sudo systemctl restart docker.service

steve@PBOYZTOY02L:~$ sudo systemctl restart docker.service
[sudo] password for steve:
Failed to restart docker.service: Unit docker.service not found.

Was the docker installation successful? The systemd service is installed as part of the docker install.
ID: 16 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
entity

Send message
Joined: 6 Feb 25
Posts: 8
Credit: 12,258
RAC: 876
Message 17 - Posted: 8 Feb 2025, 3:23:17 UTC - in response to Message 15.  

In reply to mmonnin's message of 8 Feb 2025:
When trying to set usermod I get this on several PCs.
usermod: group 'docker' does not exist

Was the docker installation successful? The docker group is created by the docker install process.
ID: 17 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
STE\/E

Send message
Joined: 7 Feb 25
Posts: 3
Credit: 358,712
RAC: 21,618
Message 20 - Posted: 8 Feb 2025, 10:55:42 UTC

I did the Docker installation procedure from the front page the only command that actually worked was the 1'st command "sudo apt-get install docker.io" & I don't get the Docker not installed error. The other commands were rejected, still get error's though even with Docker installed, I did the restart docker command, rebooted & tried the start Docker command again but still just errors ... I stopped work on all my Linux box's but 1 until this get sorted out ...
ID: 20 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
fzs600

Send message
Joined: 8 Feb 25
Posts: 3
Credit: 265,425
RAC: 14,480
Message 21 - Posted: 8 Feb 2025, 14:00:17 UTC - in response to Message 20.  

error.
<core_client_version>7.24.1</core_client_version>
<![CDATA[
<message>
process exited with code 1 (0x1, -255)</message>
<stderr_txt>
Raw image name (length 25): 62 6F 69 6E 63 5F 67 65 70 70 65 74 74 6F 5F 74 65 73 74 5F 76 65 72 5F 32
Image name: boinc_geppetto_test_ver_2
Utilizzo nome immagine boinc_geppetto_test_ver_2permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Get "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/containers/json?all=1&filters=%7B%22name%22%3A%7B%22boinc__boinc.llmentor.org_llmentorgrid__geppetto-hf-inference_153100_1739020274.570612_15_0%22%3Atrue%7D%7D": dial unix /var/run/docker.sock: connect: permission denied
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Head "http://%2Fvar%2Frun%2Fdocker.sock/_ping": dial unix /var/run/docker.sock: connect: permission denied
The image boinc_geppetto_test_ver_2 does not existDEPRECATED: The legacy builder is deprecated and will be removed in a future release.
Install the buildx component to build images with BuildKit:
https://docs.docker.com/go/buildx/

permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Post "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/build?buildargs=%7B%7D&cachefrom=%5B%5D&cgroupparent=&cpuperiod=0&cpuquota=0&cpusetcpus=&cpusetmems=&cpushares=0&dockerfile=Dockerfile&labels=%7B%7D&memory=0&memswap=0&networkmode=default&rm=1&shmsize=0&t=boinc_geppetto_test_ver_2&target=&ulimits=%5B%5D&version=1": dial unix /var/run/docker.sock: connect: permission denied
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Head "http://%2Fvar%2Frun%2Fdocker.sock/_ping": dial unix /var/run/docker.sock: connect: permission denied
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Post "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/containers/boinc__boinc.llmentor.org_llmentorgrid__geppetto-hf-inference_153100_1739020274.570612_15_0/start": dial unix /var/run/docker.sock: connect: permission denied
Error: failed to start containers: boinc__boinc.llmentor.org_llmentorgrid__geppetto-hf-inference_153100_1739020274.570612_15_0
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Get "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/containers/json?all=1&filters=%7B%22name%22%3A%7B%22boinc__boinc.llmentor.org_llmentorgrid__geppetto-hf-inference_153100_1739020274.570612_15_0%22%3Atrue%7D%7D": dial unix /var/run/docker.sock: connect: permission denied
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Get "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/containers/boinc__boinc.llmentor.org_llmentorgrid__geppetto-hf-inference_153100_1739020274.570612_15_0/json": dial unix /var/run/docker.sock: connect: permission denied
stderr from container:
stderr end
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Post "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/containers/boinc__boinc.llmentor.org_llmentorgrid__geppetto-hf-inference_153100_1739020274.570612_15_0/stop": dial unix /var/run/docker.sock: connect: permission denied
permission denied while trying to connect to the Docker daemon socket at unix:///var/run/docker.sock: Delete "http://%2Fvar%2Frun%2Fdocker.sock/v1.45/containers/boinc__boinc.llmentor.org_llmentorgrid__geppetto-hf-inference_153100_1739020274.570612_15_0": dial unix /var/run/docker.sock: connect: permission denied
2025-02-08 14:56:51 (5300): called boinc_finish(1)

</stderr_txt>
]]>
ID: 21 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
manalog
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Help desk expert

Send message
Joined: 22 Jan 25
Posts: 12
Credit: 56
RAC: 5
Message 22 - Posted: 8 Feb 2025, 14:15:40 UTC - in response to Message 21.  

Hi fzs600, this looks like the "classical" permission error.
Before being able to run docker workunits using boinc, you need to give permissions to the user boinc to use docker.
This can be done with the command:
usermod -aG docker boinc

and, if it still doesn't work,
usermod -aG docker <username>
where username it's your username (it depends on how boinc is installed, but the first case should be much more popular.

I saw that you just get a "geppetto-hf-inference" workunits: these are new workunits that I am still testing, probably in some hours I will put a news item in the home page about this new app.
ID: 22 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
mikey
Avatar

Send message
Joined: 7 Feb 25
Posts: 8
Credit: 33,666
RAC: 2,909
Message 25 - Posted: 8 Feb 2025, 14:41:02 UTC - in response to Message 22.  

When I do the Docker.io install I get these error messages:

W: Failed to fetch http://archive.ubuntu.com/ubuntu/pool/main/c/containerd-app/containerd_1.7.12-0ubuntu2~20.04.1_amd64.deb
404 Not Found [IP: 185.125.190.82 80]


W: Failed to fetch http://archive.ubuntu.com/ubuntu/pool/universe/d/docker.io-app/docker.io_24.0.7-0ubuntu2~20.04.1_amd64.deb
404 Not Found [IP: 185.125.190.82 80]

mikey
ID: 25 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
manalog
Volunteer moderator
Project administrator
Project developer
Project tester
Volunteer developer
Volunteer tester
Project scientist
Help desk expert

Send message
Joined: 22 Jan 25
Posts: 12
Credit: 56
RAC: 5
Message 26 - Posted: 8 Feb 2025, 14:45:50 UTC - in response to Message 25.  

This looks like a problem related with your Ubuntu distribution: perhaps it's too old (20.04) and Docker cannot be found into the archives.
Did you try to
sudo apt-get update
before?

Otherwise, you can use the official installation procedure from the Docker webpage: https://docs.docker.com/engine/install/ubuntu/:
# Add Docker's official GPG key:
sudo apt-get update
sudo apt-get install ca-certificates curl
sudo install -m 0755 -d /etc/apt/keyrings
sudo curl -fsSL https://download.docker.com/linux/ubuntu/gpg -o /etc/apt/keyrings/docker.asc
sudo chmod a+r /etc/apt/keyrings/docker.asc

# Add the repository to Apt sources:
echo \
  "deb [arch=$(dpkg --print-architecture) signed-by=/etc/apt/keyrings/docker.asc] https://download.docker.com/linux/ubuntu \
  $(. /etc/os-release && echo "${UBUNTU_CODENAME:-$VERSION_CODENAME}") stable" | \
  sudo tee /etc/apt/sources.list.d/docker.list > /dev/null
sudo apt-get update

 sudo apt-get install docker-ce docker-ce-cli containerd.io docker-buildx-plugin docker-compose-plugin


However, I suspect that the wrapper is not going to run on Ubuntu 20.04: now it is compiled with a new version of GLIBC and probably will refuse too run on such an old distribution. Solving this problem is on my list, I need to recompile the wrapper so that it can runs even on older distribution.
ID: 26 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
fzs600

Send message
Joined: 8 Feb 25
Posts: 3
Credit: 265,425
RAC: 14,480
Message 27 - Posted: 8 Feb 2025, 15:17:30 UTC - in response to Message 22.  

In reply to manalog's message of 8 Feb 2025:
Hi fzs600, this looks like the "classical" permission error.
Before being able to run docker workunits using boinc, you need to give permissions to the user boinc to use docker.
This can be done with the command:
usermod -aG docker boinc

and, if it still doesn't work,
usermod -aG docker <username>
where username it's your username (it depends on how boinc is installed, but the first case should be much more popular.

I saw that you just get a "geppetto-hf-inference" workunits: these are new workunits that I am still testing, probably in some hours I will put a news item in the home page about this new app.

ok
8 Feb 2025, 15:09:18 UTC 8 Feb 2025, 15:14:59 UTC Terminé et validé 47.87 73.07 1.89 geppetto-hf-inference v1.00 (mt)
x86_64-pc-linux-gnu
ID: 27 · Rating: 0 · rate: Rate + / Rate - Report as offensive     Reply Quote
1 · 2 · Next

Message boards : Number crunching : Geppetto Test 001 - Segnalazioni e Commenti

©2025 Matteo Rinaldi