Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 42 · 43 · 44 · 45 · 46 · 47 · 48 . . . 300 · Next

AuthorMessage
Profile WBT112

Send message
Joined: 11 Dec 05
Posts: 11
Credit: 1,382,693
RAC: 0
Message 94097 - Posted: 10 Apr 2020, 19:34:33 UTC

I can't upload this Task https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1029331860 . It is stuck for two days now. Always uploading to 100% but not disappreaing from the transfer tab.
Restarted BOINC manager etc. without success.
ID: 94097 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1232
Credit: 14,269,631
RAC: 3,846
Message 94101 - Posted: 10 Apr 2020, 20:14:45 UTC - in response to Message 94097.  

I can't upload this Task https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1029331860 . It is stuck for two days now. Always uploading to 100% but not disappreaing from the transfer tab.
Restarted BOINC manager etc. without success.

Your computer(s) are hidden, so I can't tell if I could help.
ID: 94101 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1670
Credit: 17,523,845
RAC: 23,480
Message 94115 - Posted: 10 Apr 2020, 21:36:34 UTC - in response to Message 94097.  
Last modified: 10 Apr 2020, 21:37:54 UTC

I can't upload this Task https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1029331860 . It is stuck for two days now. Always uploading to 100% but not disappreaing from the transfer tab.
Restarted BOINC manager etc. without success.
While the file is uploading (not waiting, but with the elapsed time counting upward), select Activity, "Suspend network activity." Give it a second or 2, then set it back to "Network activity based on preferences." This can often get a stuck transfer unstuck.
Grant
Darwin NT
ID: 94115 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 389
Credit: 12,070,320
RAC: 12,300
Message 94122 - Posted: 10 Apr 2020, 22:32:42 UTC - in response to Message 94115.  

I can't upload this Task https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1029331860 . It is stuck for two days now. Always uploading to 100% but not disappreaing from the transfer tab.
Restarted BOINC manager etc. without success.
While the file is uploading (not waiting, but with the elapsed time counting upward), select Activity, "Suspend network activity." Give it a second or 2, then set it back to "Network activity based on preferences." This can often get a stuck transfer unstuck.


Useful to know, thank you :-)
ID: 94122 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sven

Send message
Joined: 7 Feb 16
Posts: 8
Credit: 222,005
RAC: 0
Message 94144 - Posted: 11 Apr 2020, 9:47:45 UTC
Last modified: 11 Apr 2020, 9:48:50 UTC

Hi,

I'm facing well known problems with crunching Rosetta tasks.
My Boinc client is freshly installed on a new computer and the Rosetta project added shortly afterwards.

Now I receive again and again error messages that taks are exited with zero status and no finish file, see below. I had the same problem on several other computers in the past. So there seems to be a general problem with Rosetta and not with one certain computer. By the way: Resetting the projects is no help.
That way it makes no sense to continue crunching. It would be waste of time and electric power.

Thanks for your reply. And it would be great, if the project it self could be repaired that such issues can't happen anymore.

Sven


****
10.04.2020 13:23:40 | | cc_config.xml not found - using defaults
10.04.2020 13:23:40 | | Starting BOINC client version 7.16.5 for windows_x86_64
10.04.2020 13:23:40 | | Libraries: libcurl/7.47.1 OpenSSL/1.0.2s zlib/1.2.8
10.04.2020 13:23:40 | | Data directory: C:ProgramDataBOINC
10.04.2020 13:23:40 | | Running under account rothsven
10.04.2020 13:23:42 | | CUDA: NVIDIA GPU 0: Quadro M2200 (driver version 378.98, CUDA version 8.0, compute capability 5.2, 4096MB, 3416MB available, 2122 GFLOPS peak)
10.04.2020 13:23:42 | | OpenCL: NVIDIA GPU 0: Quadro M2200 (driver version 378.98, device version OpenCL 1.2 CUDA, 4096MB, 3416MB available, 2122 GFLOPS peak)
10.04.2020 13:23:42 | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 630 (driver version 21.20.16.4574, device version OpenCL 2.1, 13037MB, 13037MB available, 211 GFLOPS peak)
10.04.2020 13:23:42 | | OpenCL CPU: Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz (OpenCL driver vendor: Intel(R) Corporation, driver version 6.8.0.2, device version OpenCL 2.1 (Build 2))
10.04.2020 13:23:42 | | Windows processor group 0: 8 processors
10.04.2020 13:23:42 | | Host name: EAMODLES4PRLNQ2
10.04.2020 13:23:42 | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz [Family 6 Model 158 Stepping 9]
10.04.2020 13:23:42 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 fma cx16 sse4_1 sse4_2 movebe popcnt aes f16c rdrandsyscall nx lm avx avx2 vmx smx tm2 pbe fsgsbase bmi1 hle smep bmi2
10.04.2020 13:23:42 | | OS: Microsoft Windows 10: Enterprise x64 Edition, (10.00.16299.00)
10.04.2020 13:23:42 | | Memory: 31.85 GB physical, 36.60 GB virtual
10.04.2020 13:23:42 | | Disk: 476.03 GB total, 356.30 GB free
10.04.2020 13:23:42 | | Local time is UTC +2 hours
10.04.2020 13:23:42 | | No WSL found.
10.04.2020 13:23:42 | | General prefs: from http://lhcathomeclassic.cern.ch/sixtrack/ (last modified 06-Mar-2015 20:46:54)
10.04.2020 13:23:42 | | Host location: none
10.04.2020 13:23:42 | | General prefs: using your defaults
10.04.2020 13:23:42 | | Reading preferences override file
10.04.2020 13:23:42 | | Preferences:
10.04.2020 13:23:42 | | max memory usage when active: 16305.90 MB
10.04.2020 13:23:42 | | max memory usage when idle: 29350.62 MB
10.04.2020 13:23:43 | | max disk usage: 100.00 GB
10.04.2020 13:23:43 | | max CPUs used: 2
10.04.2020 13:23:43 | | don't compute while active
10.04.2020 13:23:43 | | don't use GPU while active
10.04.2020 13:23:43 | | suspend work if non-BOINC CPU load exceeds 15%
10.04.2020 13:23:43 | | (to change preferences, visit a project web site or select Preferences in the Manager)
10.04.2020 13:23:43 | | Setting up project and slot directories
10.04.2020 13:23:43 | | Checking active tasks
10.04.2020 13:23:43 | climateprediction.net | URL https://climateprediction.net/; Computer ID 1502231; resource share 10
10.04.2020 13:23:43 | Rosetta@home | URL https://boinc.bakerlab.org/rosetta/; Computer ID 4092002; resource share 20
10.04.2020 13:23:43 | | Setting up GUI RPC socket
10.04.2020 13:23:43 | | Checking presence of 14 project files
10.04.2020 13:23:43 | | Suspending network activity - computer is in use
10.04.2020 13:27:04 | | Resuming network activity
10.04.2020 14:22:05 | Rosetta@home | Task hgfp_dimer_5x_373_fold_SAVE_ALL_OUT_907154_43_0 exited with zero status but no 'finished' file
10.04.2020 14:22:05 | Rosetta@home | If this happens repeatedly you may need to reset the project.
10.04.2020 14:50:20 | Rosetta@home | Task hgfp_dimer_5x_373_fold_SAVE_ALL_OUT_907154_43_0 exited with zero status but no 'finished' file
10.04.2020 14:50:20 | Rosetta@home | If this happens repeatedly you may need to reset the project.
10.04.2020 15:21:01 | Rosetta@home | Task hgfp_dimer_5x_373_fold_SAVE_ALL_OUT_907154_43_0 exited with zero status but no 'finished' file
10.04.2020 15:21:01 | Rosetta@home | If this happens repeatedly you may need to reset the project.
10.04.2020 16:35:03 | Rosetta@home | Task hgfp_dimer_5x_373_fold_SAVE_ALL_OUT_907154_43_0 exited with zero status but no 'finished' file
10.04.2020 16:35:03 | Rosetta@home | If this happens repeatedly you may need to reset the project.
10.04.2020 17:31:41 | Rosetta@home | Project requested delay of 7 seconds
10.04.2020 19:23:06 | | Suspending network activity - computer is in use
10.04.2020 19:26:47 | | Resuming network activity
10.04.2020 23:04:58 | | Suspending network activity - computer is in use
10.04.2020 23:09:18 | | Resuming network activity
11.04.2020 02:25:29 | climateprediction.net | No tasks sent
11.04.2020 02:25:29 | climateprediction.net | Project requested delay of 3636 seconds
11.04.2020 02:25:36 | Rosetta@home | Project requested delay of 7 seconds
11.04.2020 02:31:24 | Rosetta@home | No tasks sent
11.04.2020 02:31:24 | Rosetta@home | Project requested delay of 7 seconds
11.04.2020 02:31:36 | Rosetta@home | Project requested delay of 7 seconds
11.04.2020 02:44:52 | Rosetta@home | Task hgfp_dimer_5x_221_fold_SAVE_ALL_OUT_906873_888_0 exited with zero status but no 'finished' file
11.04.2020 02:44:52 | Rosetta@home | If this happens repeatedly you may need to reset the project.
11.04.2020 03:24:33 | Rosetta@home | Task hgfp_monomer_54_fold_SAVE_ALL_OUT_906079_885_0 exited with zero status but no 'finished' file
11.04.2020 03:24:33 | Rosetta@home | If this happens repeatedly you may need to reset the project.
11.04.2020 04:14:30 | Rosetta@home | Task hgfp_monomer_54_fold_SAVE_ALL_OUT_906079_885_0 exited with zero status but no 'finished' file
11.04.2020 04:14:30 | Rosetta@home | If this happens repeatedly you may need to reset the project.
11.04.2020 06:42:26 | Rosetta@home | Task hgfp_dimer_5x_221_fold_SAVE_ALL_OUT_906873_888_0 exited with zero status but no 'finished' file
11.04.2020 06:42:26 | Rosetta@home | If this happens repeatedly you may need to reset the project.
11.04.2020 06:45:33 | Rosetta@home | Task hgfp_monomer_54_fold_SAVE_ALL_OUT_906079_885_0 exited with zero status but no 'finished' file
11.04.2020 06:45:33 | Rosetta@home | If this happens repeatedly you may need to reset the project.
11.04.2020 08:24:54 | Rosetta@home | Task hgfp_monomer_54_fold_SAVE_ALL_OUT_906079_885_0 exited with zero status but no 'finished' file
11.04.2020 08:24:54 | Rosetta@home | If this happens repeatedly you may need to reset the project.
11.04.2020 09:53:15 | Rosetta@home | Task hgfp_monomer_54_fold_SAVE_ALL_OUT_906079_885_0 exited with zero status but no 'finished' file
11.04.2020 09:53:15 | Rosetta@home | If this happens repeatedly you may need to reset the project.
11.04.2020 11:33:00 | | Suspending network activity - computer is in use
*******
ID: 94144 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 94146 - Posted: 11 Apr 2020, 11:00:47 UTC - in response to Message 94144.  

Now I receive again and again error messages that taks are exited with zero status and no finish file, see below.

I don't see it on any of my machines (nine Ubuntu and one Windows 7 64-bit).
It could be your anti-virus interfering with creating or accessing the file.
ID: 94146 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile WBT112

Send message
Joined: 11 Dec 05
Posts: 11
Credit: 1,382,693
RAC: 0
Message 94147 - Posted: 11 Apr 2020, 11:02:42 UTC - in response to Message 94115.  

I can't upload this Task https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1029331860 . It is stuck for two days now. Always uploading to 100% but not disappreaing from the transfer tab.
Restarted BOINC manager etc. without success.
While the file is uploading (not waiting, but with the elapsed time counting upward), select Activity, "Suspend network activity." Give it a second or 2, then set it back to "Network activity based on preferences." This can often get a stuck transfer unstuck.


I tried this however without success.
BOINC however gives me an error:

11.04.2020 13:00:13 | Rosetta@home | Started upload of conducting_fiber_fold_21_fold_SAVE_ALL_OUT_905803_166_0_r421462194_0
11.04.2020 13:00:41 | Rosetta@home | Temporarily failed upload of conducting_fiber_fold_21_fold_SAVE_ALL_OUT_905803_166_0_r421462194_0: transient HTTP error
11.04.2020 13:00:41 | Rosetta@home | Backing off 04:04:44 on upload of conducting_fiber_fold_21_fold_SAVE_ALL_OUT_905803_166_0_r421462194_0
11.04.2020 13:00:42 |  | Project communication failed: attempting access to reference site
11.04.2020 13:00:43 |  | Internet access OK - project servers may be temporarily down.

ID: 94147 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1670
Credit: 17,523,845
RAC: 23,480
Message 94150 - Posted: 11 Apr 2020, 11:18:30 UTC - in response to Message 94144.  

Hi,

I'm facing well known problems with crunching Rosetta tasks.
My Boinc client is freshly installed on a new computer and the Rosetta project added shortly afterwards.

Now I receive again and again error messages that taks are exited with zero status and no finish file, see below. I had the same problem on several other computers in the past. So there seems to be a general problem with Rosetta and not with one certain computer. By the way: Resetting the projects is no help.
But it's not an issue that other people are seeing, so your settings are very likely to be a factor.

I would strongly suggest changing
10.04.2020 13:23:43 | | don't compute while active
to allow processing,
and change
10.04.2020 13:23:43 | | suspend work if non-BOINC CPU load exceeds 15%
and leave that blank.

              Suspend when computer is in use (leave un-checked)
Suspend GPU computing when computer is in use	
   'In use' means mouse/keyboard input in last 3 minutes
  Suspend when no mouse/keyboard input in last --- minutes
     Suspend when non-BOINC CPU usage is above --- %
You have allocated only 2 CPU threads out of 8 to process BOINC work, so there shouldn't be any benefit to stopping BOINC from processing work when the computer is in use, or non BOINC CPU usage is high (Rosetta Applications run at Idle priority).

See if the errors no longer occur with those settings.
Grant
Darwin NT
ID: 94150 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1670
Credit: 17,523,845
RAC: 23,480
Message 94151 - Posted: 11 Apr 2020, 11:24:15 UTC - in response to Message 94147.  
Last modified: 11 Apr 2020, 11:24:42 UTC

I can't upload this Task https://boinc.bakerlab.org/rosetta/workunit.php?wuid=1029331860 . It is stuck for two days now. Always uploading to 100% but not disappreaing from the transfer tab.
Restarted BOINC manager etc. without success.
While the file is uploading (not waiting, but with the elapsed time counting upward), select Activity, "Suspend network activity." Give it a second or 2, then set it back to "Network activity based on preferences." This can often get a stuck transfer unstuck.


I tried this however without success.
BOINC however gives me an error:

11.04.2020 13:00:42 |  | Project communication failed: attempting access to reference site
11.04.2020 13:00:43 |  | Internet access OK - project servers may be temporarily down.

That's showing it's not able to contact the Rosettta servers.

If on the Project tab with Rosetta selected you click update, what result do you get in the Event log? (you haven't updated any AV/Malware software recently, or installed a new programme?)
Grant
Darwin NT
ID: 94151 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Tom M

Send message
Joined: 20 Jun 17
Posts: 87
Credit: 14,880,624
RAC: 117,108
Message 94154 - Posted: 11 Apr 2020, 13:21:08 UTC - in response to Message 94144.  

Now I receive again and again error messages that taks are exited with zero status and no finish file, see below.


Are you running the available cpu/threads at 90% or less? You often need at least 1 thread "idle" to keep from over-committing your cpu which can produce that symptom.

Tom
Help, my tagline is missing..... Help, my tagline is......... Help, m........ Hel.....
ID: 94154 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
EHM-1
Avatar

Send message
Joined: 21 Mar 20
Posts: 23
Credit: 183,782
RAC: 0
Message 94155 - Posted: 11 Apr 2020, 13:23:04 UTC - in response to Message 94072.  
Last modified: 11 Apr 2020, 13:24:58 UTC

FYI to anyone who may be paying attention: Rosetta resumed apparently normal behavior on my desktop this morning, after around 2 days of appearing stalled. I have no idea what is causing this behavior. Any ideas? Original post https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=92534#92534.



Eric
ID: 94155 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sven

Send message
Joined: 7 Feb 16
Posts: 8
Credit: 222,005
RAC: 0
Message 94161 - Posted: 11 Apr 2020, 15:42:51 UTC - in response to Message 94151.  

Jim, Grant,

I don't see any problems with any other projects with these settings. And as I said, I tried it on several computers. Only Rosetta frequently stops crunching tasks.
The low number of cpu kernels is for reducing the fan speed, which can reach a nerving sound.

But to find out, if any of my settings are the problem, i try and change the settings to more power consumption.
ID: 94161 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 12,120,035
RAC: 0
Message 94165 - Posted: 11 Apr 2020, 17:00:33 UTC

Problem with this task on Ubuntu

https://boinc.bakerlab.org/rosetta/result.php?resultid=1146197010

4mc4in7o_Mini_Protein_binds_IL1R_COVID-19_design4_SAVE_ALL_OUT_905389_4_1

Validate error after a couple of minutes

ERROR: [ERROR] Unable to open constraints file: mot_HHH_b1_05627_000000248_0001_1_19_H_._28a7dda1a33635c05f5ab621834c8d3e_0001_0001_0001.MSAcst
ERROR:: Exit from: src/core/scoring/constraints/ConstraintIO.cc line: 457
00:51:21 (5035): called boinc_finish(0)
ID: 94165 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1232
Credit: 14,269,631
RAC: 3,846
Message 94166 - Posted: 11 Apr 2020, 17:55:12 UTC - in response to Message 94165.  
Last modified: 11 Apr 2020, 17:58:13 UTC

svincent, that error often means that some overaggressive antivirus program somewhere on the path from the download server to your computer prevented successful downloading of the missing file.

A less likely cause is the workunit being set up to use a file that wasn't on the download server.

Does the BOINC log file say anything about attempts to download that file? That file is emptied when BOINC restarts, so it might already be too late to see them.

Do you have any antivirus program running on that computer? If so, does it keep a list of what files it decided to delete or hide?

It doesn't have to have been blocked on your computer. Some previous occurrences of that type of error were on servers on the links that connect the R@h download server to their main internet portal.
ID: 94166 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 12,120,035
RAC: 0
Message 94179 - Posted: 11 Apr 2020, 21:42:59 UTC - in response to Message 94166.  

I have no antivirus software on my computer.

Here's a portion of the event log that I hope may be helpful

Sat 11 Apr 2020 12:49:03 AM PDT | Rosetta@home | Starting task 4mc4in7o_Mini_Protein_binds_IL1R_COVID-19_design4_SAVE_ALL_OUT_905389_4_1
Sat 11 Apr 2020 12:49:04 AM PDT | Rosetta@home | Started upload of hgfp_dimer_3x_42_fold_SAVE_ALL_OUT_906263_790_0_r1313969874_0
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Too old connection (2574 seconds), disconnect it
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connection 2712 seems to be dead!
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Closing connection 2712
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Too old connection (2574 seconds), disconnect it
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connection 2713 seems to be dead!
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Closing connection 2713
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Trying 128.95.160.157:80...
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: TCP_NODELAY set
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connected to boinc.bakerlab.org (128.95.160.157) port 80 (#2714)
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: POST /rosetta_cgi/file_upload_handler HTTP/1.1
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Host: boinc.bakerlab.org
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: User-Agent: BOINC client (x86_64-pc-linux-gnu 7.16.3)
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Accept: */*
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Accept-Encoding: deflate, gzip
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Accept-Language: en_CA
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Content-Length: 314
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Content-Type: application/x-www-form-urlencoded
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server:
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: We are completely uploaded and fine
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Mark bundle as not supporting multiuse
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: HTTP/1.1 200 OK
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Date: Sat, 11 Apr 2020 07:49:05 GMT
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Server: Apache/2.4.18
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Vary: Accept-Encoding
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Content-Encoding: gzip
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Content-Length: 75
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Content-Type: text/plain
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server:
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home |
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connection #2714 to host boinc.bakerlab.org left intact
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: Found bundle for host boinc.bakerlab.org: 0x7f9cb4001520 [serially]
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: Can not multiplex, even if we wanted to!
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: Re-using existing connection! (#2714) with host boinc.bakerlab.org
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connected to boinc.bakerlab.org (128.95.160.157) port 80 (#2714)
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: POST /rosetta_cgi/file_upload_handler HTTP/1.1
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Host: boinc.bakerlab.org
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: User-Agent: BOINC client (x86_64-pc-linux-gnu 7.16.3)
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Accept: */*
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Accept-Encoding: deflate, gzip
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Accept-Language: en_CA
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Content-Length: 283230
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Content-Type: application/x-www-form-urlencoded
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server: Expect: 100-continue
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Sent header to server:
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: Mark bundle as not supporting multiuse
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: HTTP/1.1 100 Continue
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: We are completely uploaded and fine
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: Mark bundle as not supporting multiuse
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: HTTP/1.1 200 OK
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Date: Sat, 11 Apr 2020 07:49:07 GMT
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Server: Apache/2.4.18
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Content-Length: 64
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: Content-Type: text/plain
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server:
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: <data_server_reply>
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: <status>0</status>
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Received header from server: </data_server_reply>
Sat 11 Apr 2020 12:49:07 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connection #2714 to host boinc.bakerlab.org left intact
Sat 11 Apr 2020 12:49:08 AM PDT | Rosetta@home | Finished upload of hgfp_dimer_3x_42_fold_SAVE_ALL_OUT_906263_790_0_r1313969874_0
Sat 11 Apr 2020 12:51:24 AM PDT | Rosetta@home | Computation for task 4mc4in7o_Mini_Protein_binds_IL1R_COVID-19_design4_SAVE_ALL_OUT_905389_4_1 finished
ID: 94179 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1670
Credit: 17,523,845
RAC: 23,480
Message 94184 - Posted: 11 Apr 2020, 22:44:22 UTC - in response to Message 94161.  

Jim, Grant,

I don't see any problems with any other projects with these settings. And as I said, I tried it on several computers. Only Rosetta frequently stops crunching tasks.
That's our point- you are having these issues, on multiple systems, yet other people aren't.
So the most likely cause is what is different between what you have and everyone else has? And that would appear to be your settings.


The low number of cpu kernels is for reducing the fan speed, which can reach a nerving sound.
Yep, so because you have limited the number of Tasks that will run, there is no need to start & stop computation work when other programmes are running.


But to find out, if any of my settings are the problem, i try and change the settings to more power consumption.
Hopefully it will sort things out.
Grant
Darwin NT
ID: 94184 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2115
Credit: 41,112,600
RAC: 19,835
Message 94187 - Posted: 12 Apr 2020, 0:31:21 UTC - in response to Message 94161.  

I don't see any problems with any other projects with these settings.

Just because Rosetta runs on the Boinc platform, like other projects, there's no requirement for it to run under the same parameters as other projects. Indeed, it's <very> different.

Rosetta imposes very high demands on RAM, disk space and CPU power.
It needs a relatively long processing time (compared to some projects, but lower than some others) but also requires a relatively short turnaround time.
Consequently, the buffer of tasks it allow users to hold offline are lower than you may be used to.

If people insist on maintaining the assumptions that apply to entirely different projects, those assumptions are going to fall flat on their face here and problems are inevitable.

So be prepared to adapt
ID: 94187 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1232
Credit: 14,269,631
RAC: 3,846
Message 94188 - Posted: 12 Apr 2020, 0:40:21 UTC - in response to Message 94179.  

I have no antivirus software on my computer.

Here's a portion of the event log that I hope may be helpful

Sat 11 Apr 2020 12:49:03 AM PDT | Rosetta@home | Starting task 4mc4in7o_Mini_Protein_binds_IL1R_COVID-19_design4_SAVE_ALL_OUT_905389_4_1
Sat 11 Apr 2020 12:49:04 AM PDT | Rosetta@home | Started upload of hgfp_dimer_3x_42_fold_SAVE_ALL_OUT_906263_790_0_r1313969874_0
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Too old connection (2574 seconds), disconnect it
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connection 2712 seems to be dead!
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Closing connection 2712
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Too old connection (2574 seconds), disconnect it
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Connection 2713 seems to be dead!
Sat 11 Apr 2020 12:49:05 AM PDT | Rosetta@home | [http] [ID#4338] Info: Closing connection 2713

[snip]

Looks like you found log file information for an upload.

You need to look for information on downloads instead, which for the same task, should be earlier in the log file than the upload.

As I mentioned before, the cause is not necessarily on your computer.
ID: 94188 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 12,120,035
RAC: 0
Message 94247 - Posted: 12 Apr 2020, 14:53:48 UTC - in response to Message 94188.  

Sorry: I don't seem to be able to find that info: the log didn't go back that far. Anyway its the only task where it happened.
ID: 94247 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Evil Penguin
Avatar

Send message
Joined: 10 Jun 08
Posts: 5
Credit: 10,168,989
RAC: 0
Message 94255 - Posted: 12 Apr 2020, 17:12:31 UTC

Could someone please help me determine if the system I'm running is to blame or if these are bad WUs?
https://boinc.bakerlab.org/rosetta/result.php?resultid=1147046920

https://boinc.bakerlab.org/rosetta/show_host_detail.php?hostid=3817017

Only thing out of spec is how fast I have the memory running.
I ran MemTest Pro for a good 24 hours and no errors.
ID: 94255 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 42 · 43 · 44 · 45 · 46 · 47 · 48 . . . 300 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org