Message boards : Number crunching : Rosetta Beta 6.00
Previous · 1 · 2 · 3 · 4 · 5 · 6 · 7 · Next
Author | Message |
---|---|
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 9,591 |
All errors. And, obviously, these works didn't pass through Ralph@home... |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 9,591 |
Up to now, over 180 wus (new7snme) without errors Well done, guys |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1994 Credit: 9,623,704 RAC: 9,591 |
I think that the 6.xx branch of code makes the same things as the 4.xx plus other things. But it's in beta since the beginning of April. Why not abandon the 4.xx branch (that has over 3 years old code)? |
Gray Handcock Send message Joined: 26 Sep 05 Posts: 20 Credit: 2,018,415 RAC: 0 |
Hi my computer specs: Intel(R) Core(TM) i7-4790 CPU @ 3.60GHz [Family 6 Model 60 Stepping 3] Number of processors 8 Operating System Debian GNU/Linux 12 (bookworm) [6.1.0-15-amd64|libc 2.36] BOINC version 7.20.5 Memory 7833.52 MB Rosetta v4.20 x86_64-pc-linux-gnu are completing normally as expected Rosetta Beta v6.05 x86_64-pc-linux-gnu errors out, every single one gives "Error while computing" Is there any option to block the "beta 6*" series until this is sorted out ? thanks |
Gray Handcock Send message Joined: 26 Sep 05 Posts: 20 Credit: 2,018,415 RAC: 0 |
Hi UPDATE: currently the errors are at 116 and counting. Just for the record, this is a headless install of Debian stable and no overclocking applied to the hardware, which has been set up just for Rosetta & WCG. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2124 Credit: 41,228,659 RAC: 10,982 |
Hi Sadly not. On the plus side they're erroring out immediately, so no processing time is wasted - just bandwidth in downloading and returning them. Tasks that error out go to other users so they're not wasted, though that's no consolation to you. Sorry |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 188 Credit: 6,431,332 RAC: 5,665 |
I do not seem to be having problems with beta tasks. State: All (111) · In progress (19) · Validation pending (0) · Validation inconclusive (0) · Valid (91) · Invalid (0) · Error (1) Application: All (111) · Rosetta (9) · Rosetta Beta (102) · Rosetta Mini (0) · rosetta python projects (0) My machine is: CPU type GenuineIntel Intel(R) Xeon(R) W-2245 CPU @ 3.90GHz [Family 6 Model 85 Stepping 7] Number of processors 16 Operating System Linux Red Hat Enterprise Linux Red Hat Enterprise Linux 8.9 (Ootpa) [4.18.0-513.9.1.el8_9.x86_64|libc 2.28] BOINC version 7.20.2 Memory 128073.86 MB Cache 16896 KB Swap space 15992 MB Total disk space 488.04 GB Free Disk Space 480.6 GB Measured floating point speed 5955.12 million ops/sec Measured integer speed 24244.4 million ops/sec And this is the most recently completed task: 1540123327 1370471503 10 Dec 2023, 18:01:22 UTC 12 Dec 2023, 16:30:58 UTC Completed and validated 28,991.88 28,785.55 488.23 Rosetta Beta v6.05 x86_64-pc-linux-gnu And this is the only error one. 1538969660 1368095704 8 Dec 2023, 4:25:57 UTC 8 Dec 2023, 4:56:41 UTC Cancelled by server 0.00 0.00 --- Rosetta Beta v6.05 x86_64-pc-linux-gnu |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 6,015 |
Hi What is the error that shows in the stderror file - click on the workunit in the tasks link of your account. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1681 Credit: 17,854,150 RAC: 22,647 |
What is the error that shows in the stderror file - click on the workunit in the tasks link of your account.Here's the output of one Task. <core_client_version>7.20.5</core_client_version> <![CDATA[ <message> process exited with code 127 (0x7f, -129)</message> <stderr_txt> ../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory </stderr_txt> ]]> Installation/permissions issue? Grant Darwin NT |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 393 Credit: 12,110,248 RAC: 6,015 |
What is the error that shows in the stderror file - click on the workunit in the tasks link of your account.Here's the output of one Task. Certainly looks like it. |
Gray Handcock Send message Joined: 26 Sep 05 Posts: 20 Credit: 2,018,415 RAC: 0 |
Hi - see the below for the last 6.05 - which obviously failed to proceed: <core_client_version>7.20.5</core_client_version> <![CDATA[ <message> process exited with code 127 (0x7f, -129)</message> <stderr_txt> ../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory </stderr_txt> ]]> Hope it helps to fix this issue - the 4.20 WUs are all fine and validated and failed 6.50 WUs are at 118 now Thanks |
Gray Handcock Send message Joined: 26 Sep 05 Posts: 20 Credit: 2,018,415 RAC: 0 |
Hi - see the below for the last 6.05 - which obviously failed to proceed: I am wondering if not having a GUI installed might be the problem - the box is accessed via ssh, using command--line only. |
Gray Handcock Send message Joined: 26 Sep 05 Posts: 20 Credit: 2,018,415 RAC: 0 |
Hi I have just installed libgl1 (with a bunch of dependencies). I will allow more work and see if that fixes the problem Thanks |
Gray Handcock Send message Joined: 26 Sep 05 Posts: 20 Credit: 2,018,415 RAC: 0 |
Hi So, to update the info for anyone else with this problem: Prior to the "fix" for Rosetta Beta v6.05 x86_64-pc-linux-gnu I had 118 WUs error out of 118 Error message: <stderr_txt> ../../projects/boinc.bakerlab.org_rosetta/rosetta_beta_6.05_x86_64-pc-linux-gnu: error while loading shared libraries: libGL.so.1: cannot open shared object file: No such file or directory </stderr_txt> The fix for my problem would appear to be the installation of libgl1, which also installed the following dependencies: libxcb-present0, libxxf86vm1, libglx-mesa0, libglvnd0, libx11-xcb1, libxshmfence1, libxcb-dri2-0, libxcb-dri3-0, libpciaccess0, libglx0, libdrm-nouveau2, libllvm15, libz3-4, libgl1-mesa-dri, libdrm-common, libxcb-glx0, libglapi-mesa, libdrm-amdgpu1, libdrm-radeon1, libdrm2, libxcb-randr0, libxcb-shm0, libxcb-sync1, libdrm-intel1, libxfixes3, libxcb-xfixes0 I received 11 WUs for Rosetta Beta v6.05 x86_64-pc-linux-gnu yesterday after allowing more work. 6 of them have completed and been validated - the rest are in progress. As a comment it would seem a bit strange to require what would appear to be GUI-related files for CPU work - but the fix does seem to be functioning at this time, so I am happy :) Thanks |
Aurum Send message Joined: 12 Jul 17 Posts: 32 Credit: 38,158,977 RAC: 0 |
I use an APP_CONFIG to run all my projects. I can set to run just 1 beta WU, never tried to set max tasks to 0. <max_concurrent>Zero</max_concurrent> = <max_concurrent>Infinity</max_concurrent> |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 188 Credit: 6,431,332 RAC: 5,665 |
I have a bunch of beta 6.05 tasks running. And a bunch earlier have also run. They have one to five houors on them. State: All (107) · In progress (20) · Validation pending (0) · Validation inconclusive (0) · Valid (87) · Invalid (0) · Error (0) Application: All (107) · Rosetta (39) · Rosetta Beta (68) · Rosetta Mini (0) · rosetta python projects (0) |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,169,305 RAC: 3,857 |
I use an APP_CONFIG to run all my projects. I can set to run just 1 beta WU, never tried to set max tasks to 0. I don't know what your settings are but I'm getting nearly 100% valid tasks here on my both Windows and Linux pc;s. State: All (2097) · In progress (576) · Validation pending (0) · Validation inconclusive (0) · Valid (1514) · Invalid (0) · Error (7) Application: All (2097) · Rosetta (102) · Rosetta Beta (1995) · Rosetta Mini (0) · rosetta python projects (0) |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 188 Credit: 6,431,332 RAC: 5,665 |
I don't know what your settings are but I'm getting nearly 100% valid tasks here on my both Windows and Linux pc;s. I get the same kind of thing. I get no errors for my fast Linux mchine, but I got 5 for my slower Windows 10 machine. State: All (171) · In progress (37) · Validation pending (0) · Validation inconclusive (0) · Valid (129) · Invalid (0) · Error (5) Application: All (171) · Rosetta (50) · Rosetta Beta (121) · Rosetta Mini (0) · rosetta python projects (0) |
The Ancient One Send message Joined: 4 Oct 05 Posts: 11 Credit: 975,605 RAC: 1,788 |
Hi, microsoft detected the following error: rosetta_beta_6.04_windows_x86_64.exe Description Faulting Application Path: C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettarosetta_beta_6.04_windows_x86_64.exe Creation Time: 24/12/2023 12:57:52 Problem: Stopped working Status: Report sent Problem signature Problem Event Name: APPCRASH Application Name: rosetta_beta_6.04_windows_x86_64.exe Application Version: 0.0.0.0 Application Timestamp: 650a8b67 Fault Module Name: StackHash_0000 Fault Module Version: 10.0.19041.3636 Fault Module Timestamp: 9b64aa6f Exception Code: c0000374 Exception Offset: PCH_84 Extra information about the problem Bucket ID: 94f0a8d188c87f8f34609c10c9d5462c (1468345074442389036) |
Aurum Send message Joined: 12 Jul 17 Posts: 32 Credit: 38,158,977 RAC: 0 |
Me too. Send more betas.I use an APP_CONFIG to run all my projects. I can set to run just 1 beta WU, never tried to set max tasks to 0. My coment was meant to say that the syntax <max_concurrent>0</max_concurrent> is meaningless to BOINC. |
Message boards :
Number crunching :
Rosetta Beta 6.00
©2024 University of Washington
https://www.bakerlab.org