Problems and Technical Issues with Rosetta@home

Author	Message
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1862 Credit: 18,534,891 RAC: 0	Message 102749 - Posted: 20 Sep 2021, 5:09:51 UTC - in response to Message 102728. How can I make it stop? I have been a long time supporter of RH, but will disconnect from the project if it keeps wasting my bandwidth with 2GB failed downloads. Temporarily STOP REQUESTING NEW WORK FOR THE PROJET by settling NO NEW WORK for Rosetta on your PC(S) in BOINC until the project fixes the problem. That means you will get no work at all, even though there are just under 13 million Rosetta 4.20 Tasks available. Grant Darwin NT ID: 102749 · Rating: 0 · rate: / Reply Quote

Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1862 Credit: 18,534,891 RAC: 0	Message 102750 - Posted: 20 Sep 2021, 5:16:20 UTC - in response to Message 102745. I received an odd message today: Rosetta@home: Notice from server VirtualBox is not installed 9/19/2021 6:01:50 PM It is correct that I do not have VirtualBox installed, and I have not used VirtualBox to run BOINC programs. That was sorted out yesterday. It started on the 18th & when it did, it stopped those without VirtualBox from getting any Rosetta work- even that which doesn't need VirtualBox. Once they fixed it, there were no more notices about a lack of VirtualBox in Event logs, and work started flowing again- as can be seen in the Graph below. In particular, the big dip on the 18th. Grant Darwin NT ID: 102750 · Rating: 0 · rate: / Reply Quote

Admin Project administrator Send message Joined: 1 Jul 05 Posts: 5145 Credit: 0 RAC: 0	Message 102761 - Posted: 20 Sep 2021, 17:58:33 UTC There is a researcher in the group that has submitted a large batch of VM jobs that requires Virtual Box. More information is provided at https://boinc.berkeley.edu/wiki/VirtualBox. This is the first large batch of "rosetta_python_projects" VM jobs but we hope to have more in the future. Since this is our first attempt at using BOINC's VM option, we will undoubtedly run into issues as we scale up. Please be patient as we try to resolve these issues. Our standard rosetta application is still being used but the number of available jobs can fluctuate as it has before depending on the computing demand from our researchers. The VM app feature allows researchers much more flexibility to run scientific software that otherwise would not be possible on R@h. For example, the "rosetta_python_projects" app allows researchers to run deep learning based structure prediction pipelines including RoseTTAFold and AlphaFold2, and other python based sofware packages like AIMNet which is being used in the current batch of VM jobs to minimize potential microcycle therapeutics. ID: 102761 · Rating: 0 · rate: / Reply Quote

Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0	Message 102763 - Posted: 20 Sep 2021, 20:06:43 UTC - in response to Message 102761. This is the first large batch of "rosetta_python_projects" VM jobs but we hope to have more in the future. Very good. I have several machines with lots of memory waiting to do them. BUT: You need to make them selectable from the regular Rosettas. A simple checkbox would do, but even better would be the ability to select the number downloaded (or running) at at time. See LHC, or Cosmology, or WCG for examples. ID: 102763 · Rating: 0 · rate: / Reply Quote

Ole Pettersen Send message Joined: 4 Dec 10 Posts: 7 Credit: 12,894,634 RAC: 5,254	Message 102765 - Posted: 21 Sep 2021, 0:15:46 UTC - in response to Message 102745. It is just an informational message, hoping that you will install VBox to crunch more tasks. Thanks. ID: 102765 · Rating: 0 · rate: / Reply Quote

Sid Celery Send message Joined: 11 Feb 08 Posts: 2386 Credit: 45,704,921 RAC: 23,612	Message 102768 - Posted: 21 Sep 2021, 1:19:33 UTC - in response to Message 102761. There is a researcher in the group that has submitted a large batch of VM jobs that requires Virtual Box. More information is provided at https://boinc.berkeley.edu/wiki/VirtualBox Virtualization and BOINC BOINC uses virtualization to allow scientists to develop applications on their preferred operating system (usually Linux) and then run them on volunteered Windows and Mac computers. We call these VM apps. BOINC's virtualization support uses a system called VirtualBox, which is open-source software maintained and distributed by Oracle. For more information about VirtualBox, please see their web site at https://www.virtualbox.org/ To run VM apps, your computer must have VirtualBox installed. Starting with BOINC version 7.2.28, the recommended BOINC installer for Windows includes VirtualBox as well. You can also install VirtualBox separately. We recommend the VirtualBox version that is included in the recommended BOINC Windows installer, since newer VirtualBox versions do not work correctly for all projects. If you are running Windows, it is essential that you install a version of BOINC that matches your version of Windows - use 64-bit BOINC on 64-bit versions of Windows, 32-bit BOINC on 32-bit versions of Windows. Otherwise, BOINC will not be able to detect the presence of VirtualBox. Oh! So it's that thing I've been ignoring for several years... #Doh Download BOINC + VirtualBox ID: 102768 · Rating: 0 · rate: / Reply Quote

Tomcat雄猫 Send message Joined: 20 Dec 14 Posts: 180 Credit: 5,389,320 RAC: 0	Message 102781 - Posted: 21 Sep 2021, 9:54:11 UTC - in response to Message 102768. Last modified: 21 Sep 2021, 9:54:22 UTC aaaf-IDC_pp-FPR_pp-mNHM_pp-NMBEN3_pp_0_1737815_2_1 <core_client_version>7.16.11</core_client_version> <![CDATA[ <message> WU download error: couldn't get input files: <file_xfer_error> <file_name>AIMNet_vm_v2.vdi</file_name> <error_code>-119 (md5 checksum failed for file)</error_code> <error_message>MD5 check failed</error_message> </file_xfer_error> </message> ]]> There goes another one. I have yet to encounter a Python task that doesn't error out. Is it something wrong with my setup, or is there a problem with these tasks? I've tried the version bundled with BOINC and the latest version, still 100% error rate. ID: 102781 · Rating: 0 · rate: / Reply Quote

lazyacevw Send message Joined: 18 Mar 20 Posts: 12 Credit: 93,576,463 RAC: 0	Message 102783 - Posted: 21 Sep 2021, 12:05:08 UTC I feel like I am missing out on some new information or something. Do these new "rosetta_python_projects" run native on Linux? Virtualbox is only needed on Windows and Mac, correct? I was under the impression that all Windows BOINC installations were already containers or VMs on Windows or Macs. ID: 102783 · Rating: 0 · rate: / Reply Quote

Falconet Send message Joined: 9 Mar 09 Posts: 354 Credit: 1,579,842 RAC: 1,527	Message 102784 - Posted: 21 Sep 2021, 12:28:43 UTC - in response to Message 102783. I feel like I am missing out on some new information or something. Do these new "rosetta_python_projects" run native on Linux? Virtualbox is only needed on Windows and Mac, correct? I was under the impression that all Windows BOINC installations were already containers or VMs on Windows or Macs. The applications page says: Linux running on an AMD x86_64 or Intel EM64T CPU 1.03 (vbox64) Same as Windows and MacOS. No mention of a native app. ID: 102784 · Rating: 0 · rate: / Reply Quote

dcdc Send message Joined: 3 Nov 05 Posts: 1833 Credit: 123,611,508 RAC: 19,708	Message 102786 - Posted: 21 Sep 2021, 12:54:04 UTC I'm pretty sure it uses VBox regardless of the host OS. I guess that way it can ship with the correct dependancies etc. ID: 102786 · Rating: 0 · rate: / Reply Quote

lazyacevw Send message Joined: 18 Mar 20 Posts: 12 Credit: 93,576,463 RAC: 0	Message 102787 - Posted: 21 Sep 2021, 14:37:28 UTC - in response to Message 102784. Last modified: 21 Sep 2021, 14:41:50 UTC Interesting. I took a look at the applications page https://boinc.bakerlab.org/rosetta/apps.php and the VirtualBox page https://boinc.berkeley.edu/wiki/VirtualBox but I don't see anything that would indicate a definitive answer. My Linux systems do not have VirtualBox installed. The apps page does indicate they are running on Linux systems but they don't indicate if VirtualBox is or needs to be installed. The same goes for Windows in the apps page. It was my understanding that Rosetta tasks were already written for Linux environments and so it was just easier to containerize Linux on Windows than re-write everything for Windows or Macs. I haven't seen anything in my logs about downloading AIMNet_vm_v2.vdi and when I check out my boinc directory, I see minirosetta and 4.20 executables but I don't see anything about vbox64 or python. I have 671 tasks in progress and I am showing 0 rosetta python tasks. I was just curious if there is anything I need to do to keep my systems up to date so that they can maximize processing capability for Rosetta. ID: 102787 · Rating: 0 · rate: / Reply Quote

Falconet Send message Joined: 9 Mar 09 Posts: 354 Credit: 1,579,842 RAC: 1,527	Message 102788 - Posted: 21 Sep 2021, 14:44:07 UTC - in response to Message 102787. If Virtualbox isn't installed, your system won't download any work units or executables related to the vbox app. This "1.03 (vbox64)" indicates the need of the pyrosetta app for virtualbox. ID: 102788 · Rating: 0 · rate: / Reply Quote

YAG Send message Joined: 13 Oct 19 Posts: 7 Credit: 13,015,426 RAC: 0	Message 102795 - Posted: 21 Sep 2021, 20:42:57 UTC Hello, Why could I receive the message "Communication with VM Hipervisor failed"? In the VBox register appears the error «Failed to create the VirtualBox object! NS_ERROR_SOCKET_FAIL (0xC1F30200) IPC Daemon socket error.» I have the Boinc version 7.16.16 and the VirtualBox 6.1.26 r145957 running on Debian GNU/Linux 11 (bullseye) [5.10.0-8-amd64\|libc 2.31 (Debian GLIBC 2.31-13)]. The boinc user is added to the vboxusers, vboxguests and libvirt groups. Any clue about what could be happening? I have no experience with VirtualBox, I installed it just as a Rosetta@home requirement. ID: 102795 · Rating: 0 · rate: / Reply Quote

Ole Pettersen Send message Joined: 4 Dec 10 Posts: 7 Credit: 12,894,634 RAC: 5,254	Message 102798 - Posted: 22 Sep 2021, 2:53:51 UTC - in response to Message 102795. I got this when I tried to open Oracle VM VirtualBox: ID: 102798 · Rating: 0 · rate: / Reply Quote

dcdc Send message Joined: 3 Nov 05 Posts: 1833 Credit: 123,611,508 RAC: 19,708	Message 102799 - Posted: 22 Sep 2021, 6:40:34 UTC Last modified: 22 Sep 2021, 6:41:06 UTC You could try this app: http://leomoon.com/downloads/application/leomoon-cpu-v/ And there are other tests listed here too: https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=4161 ID: 102799 · Rating: 0 · rate: / Reply Quote

Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0	Message 102806 - Posted: 22 Sep 2021, 13:31:46 UTC - in response to Message 102787. Last modified: 22 Sep 2021, 13:37:56 UTC Interesting. I took a look at the applications page https://boinc.bakerlab.org/rosetta/apps.php and the VirtualBox page https://boinc.berkeley.edu/wiki/VirtualBox but I don't see anything that would indicate a definitive answer. My Linux systems do not have VirtualBox installed. The apps page does indicate they are running on Linux systems but they don't indicate if VirtualBox is or needs to be installed. The same goes for Windows in the apps page. It was my understanding that Rosetta tasks were already written for Linux environments and so it was just easier to containerize Linux on Windows than re-write everything for Windows or Macs. You need VirtualBox for Linux too. There are different versions of Linux, with different libraries. It can cause problems on any project. Even LHC uses VirtualBox on Linux for Theory and ATLAS, though they also have "native" apps for them. But the native apps need a "container" such as singularity, which is even more complicated to set up than just installing VirtualBox. Though it would be nice to have native versions for Rosetta too if they can manage it, since it runs a little more efficiently, and probably needs less memory. ID: 102806 · Rating: 0 · rate: / Reply Quote

Greg_BE Send message Joined: 30 May 06 Posts: 5756 Credit: 6,087,782 RAC: 1,666	Message 102809 - Posted: 22 Sep 2021, 16:27:01 UTC Last modified: 22 Sep 2021, 16:41:27 UTC DEK/ADMIN/dcdc What the heck is going on with your scheduler? Or is there a bug in BOINC? I have now 998 (What the !&#?) tasks sitting in my queue. I lost 11 python tasks because something clogged my system with over 600 4.20 tasks I am going to abort 900 tasks because there is no way hell I can plow through that many. That's 332 days of crunching. That just beyond belief!!! Plus that clogs up my system because your project is limited to a certain number of cpu's while I share the rest with several other projects and all your tasks have 8 hour run times. If I drop that to 4 that would be still over 5 months of work. And all these tasks were due on the 24th? Explain this to me..... BOINC bug or your system bug? ---- Sent back 880 or so tasks...geees...i better go find a limiting command ID: 102809 · Rating: 0 · rate: / Reply Quote

Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0	Message 102811 - Posted: 22 Sep 2021, 16:41:08 UTC - in response to Message 102809. What the heck is going on with your scheduler? Or is there a bug in BOINC? I have now 998 (What the !&#?) tasks sitting in my queue. I have not had that problem since getting rid of the "max concurrent" in the app_config.xml, as we discussed earlier. ID: 102811 · Rating: 0 · rate: / Reply Quote

Greg_BE Send message Joined: 30 May 06 Posts: 5756 Credit: 6,087,782 RAC: 1,666	Message 102812 - Posted: 22 Sep 2021, 17:20:46 UTC - in response to Message 102811. Last modified: 22 Sep 2021, 17:37:14 UTC What the heck is going on with your scheduler? Or is there a bug in BOINC? I have now 998 (What the !&#?) tasks sitting in my queue. I have not had that problem since getting rid of the "max concurrent" in the app_config.xml, as we discussed earlier. I have to have max concurrent in order to limit the number of cpu's RAH uses, otherwise my idea of splitting up my system so every project has its own group of cores is out the door and then I run into problem of every project dominating my system and some get all the work for days on end and others don't. If RAH would do like LHC and allow ME to pick how many cores to give it, then I would not have to do max concurrent. I need to find a command to limit the number of files downloaded. OR would project_max_concurrent be any better than max_concurrent? How about ncpus and take out max_concurrent? ID: 102812 · Rating: 0 · rate: / Reply Quote

Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0	Message 102813 - Posted: 22 Sep 2021, 17:51:13 UTC - in response to Message 102812. OR would project_max_concurrent be any better than max_concurrent? How about ncpus and take out max_concurrent? Unfortunately, project_max_concurrent won't work either. https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5720&postid=45323#45323 You could try ncpus, but I think that is mainly for multi-threaded apps. What I do is just create separate BOINC instances for each type of work unit when I need to limit them. Then you can just set "use at most % of the processors" to limit it to what you want. It is a bit of a pain, but actually simple enough once you do it the first time. This more or less gives you all you need. https://www.overclock.net/threads/guide-setting-up-multiple-boinc-instances.1628924/ I think that on Windows, they neglect to tell you how to start it up automatically, though you can do it manually easily enough. I used Task Scheduler to start the BOINC client automatically, but don't have the details since I usually use Ubuntu, where it is a bit simpler. I think you have to start up BOINC Manager separately in the "Programs/Startup" folder, but that was on Win7 and I am now on Win10. ID: 102813 · Rating: 0 · rate: / Reply Quote