Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 63 · 64 · 65 · 66 · 67 · 68 · 69 . . . 300 · Next
Author | Message |
---|---|
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 3,846 |
Given that the number of very large RAM requirement Tasks is only a small percentage of the total number of Tasks available, allowing 1.3GB per Task means you will usually have more than enough room for the OS. For Rosetta@home, the large tasks are usually due to large proteins. Although, I've seen a message saying that doing anything for COVID-19 under Rosetta@home also causes large tasks. Rosetta@home hasn't said much if anything about whether the large protein work comes in batches. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2115 Credit: 41,115,238 RAC: 19,699 |
yeah. It's identical to BOINC, but the WCG team does thorough testing before updating their version of the BOINC software. So in theory, less likely to run into bugs. Thinking about it again, I might have seen a notice mentioning a WCG version recently, but I thought it was just the usual Boinc Manager but with a WCG badge stuck on it. The idea that IBM might also have done a security check on it makes me think I should use that version as I have very little faith in the standard Boinc Manager. But maybe updates will be delayed too. I'll stay as I am and just try to pay more attention to WCG notices to see how it works after version changes. Thanks for pointing it out anyway. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 3,846 |
yeah. It's identical to BOINC, but the WCG team does thorough testing before updating their version of the BOINC software. So in theory, less likely to run into bugs. Watch for signs that the WCG BOINC is only available as a 32-bit version, and is therefore unable to run any tasks using a program compiled to run under 64-bit Windows. I've seen nothing definite about whether it is, so I'll only call it something to watch for. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2115 Credit: 41,115,238 RAC: 19,699 |
WCG is a separate BOINC project. Unrelated to Rosetta@home. Thanks. If it's 32-bit I think I'll stick with the standard version |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2115 Credit: 41,115,238 RAC: 19,699 |
yeah. It's identical to BOINC, but the WCG team does thorough testing before updating their version of the BOINC software. So in theory, less likely to run into bugs. Seen that now. Thanks for confirming. I'll stick with what I'm used to. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2115 Credit: 41,115,238 RAC: 19,699 |
In short, no. Sounds very odd. When I first saw your msg for some reason none of the other very good replies and suggestions were showing, nor were your useful images, so I've only just caught up. If I can make some further suggestions: Suspend when computer is on battery: Untick. Your computer doesn't run on battery so the selection is redundant. It's intended for laptops/portables that aren't plugged into the mains. Suspend GPU computing when computer is in use: Untick. None of the projects you run offer GPU computing. And clear the field that mentions mouse/keyboard input - also redundant. Suspend when Non-Boinc CPU usage is above: Untick and clear the %age. This requires some further explanation since you mention other programs you run. You may not have picked up on it, but Grant mentioned that Rosetta (and I think tasks from all projects under Boinc) run at 'Low' or 'Idle' priority. This was a concept I didn't know existed until I started using Boinc. If you open the task manager under Windows 10, go to the "Details" column, find a Rosetta task, right-click on it and hover the mouse over the "Set Priority" option, a little sub-window will open showing the priority Rosetta runs at. On mine it shows 'Low'. Do the same on any other program you can identify and it'll show 'Normal' - a higher priority than 'Low'. That means, when you type, move the mouse, play music, watch video, or run any other program on your PC, they will use the CPU ahead of Rosetta - or Rosetta defers priority to anything else you ask your PC to do. Sometimes circumstances can arise when there's a conflict between Rosetta and other programs you run, but it's generally something pretty specialist. Recently, when work was hard to come by here, the only way I could tell I'd run out of Rosetta tasks was that the fans went quieter and the room became less hot. Certainly not for responsiveness of my PC. But if an issue arises with that program you mentioned, come back again because I think Grant had a way around it. In the usage limits section: Use at most 75% of CPU time: Unless you have issues with temperatures, using anything other than 100% can lead to task errors, particularly if tasks are starting & stopping a lot. Over-high temperatures are the only reason I'd think of reducing from 100%. But see below too. Use at most 75% of CPUs: I understand why you've cut down to 3-cores rather than 4 if it's to do with the limited RAM you have - I misunderstood this before. While I have more RAM on my i3-8350, I've also got my memory allocation set to 65% in use and 85% not in use. So I'm wondering if you should try bumping your RAM allocation up to 60% and 80% from 50% & 75% and then seeing if that allows you to run the 4th core as well. If you return to your original problem of tasks refusing to run then by all means return to 75% of CPUs, but it's worth a fresh try after changing your RAM limits imo. And note that running all 4-cores at 100%, 100% of the time, with nothing set in the "when to suspend" section, that will push up your temperatures as well, so there's two reasons you may want to go back to 75% cores if it's problematic. But even if you do drop back on cores, the other changes are worth doing. |
Stevie G Send message Joined: 15 Dec 18 Posts: 107 Credit: 822,669 RAC: 1,625 |
Mikey: Thanks for the response. I uninstalled WCG, downloaded BOINC again, reinstalled the BOINC Manager and all the problems were solved. Got my projects back, but still no Rosetta tasks. I think you may be right, that WCG may have been installed with BAM. It was making my computer run r-e-a-l-l-y slowly. It changed my Asteroids completion time from around 3 hours to over 4:30 hours. So I added WCG in the BOINC Manager and it downloaded one task. We'll see how that works. My machine is back up to speed now and waiting for more Rosetta. Maybe you guys were correct in saying this box is too low-spec for that kind of work. Steven Gaber Oldsmar, FL |
EHM-1 Send message Joined: 21 Mar 20 Posts: 23 Credit: 183,782 RAC: 0 |
When I first saw your msg for some reason none of the other very good replies and suggestions were showing, nor were your useful images, so I've only just caught up. @Sid: Very good of you to go through all that so thoroughly for me! I've implemented most of your suggestions. And thanks for the tip about where application priority is assigned in Windows -- never been in that corner of Task Manager before. I do now suspect that it was expired work units that may have hung up Rosetta. Since it fetched new ones and resumed processing yesterday, I'm now waiting to see when BOINC will fetch new work units for WCG, which it has yet to do since it finished the batch it processed yesterday. Today I've been messing with various commands to probe how that all works. Funny to think that after almost 20 years of running BOINC, I've learned more about it in the last two days than in the entire preceding time. Thanks again! Eric system: up-to-date Windows 10, Intel quad-core 3.6 GHz processor, 8 GB RAM |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,118,186 RAC: 6,004 |
Mikey: I'm glad it worked for you!!! Now it's depends on your resource share settings for each project on how often you get tasks for each project, the higher the rescourse share the more tasks per day you will run, as long as they are available. Think of the maximum resource share as percentage parts of 100 and each project gets a share. What's easiest is to set Rosetta at say 50%, WCG at 25% and some orther project at 25% andlet Boinc figure it out,which it will do over time.Just besure to keep your cache sizes small so you don't run into deadline problems. With Rosetta's 3 day deadline if you have 3 days of work NO other projects will crunch because their deadline will be further out than 3 days. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1671 Credit: 17,526,840 RAC: 23,319 |
Anyway, restricting the number of cores can't help with memory, Boinc will always use as many cores as possible until it hits your set memory limit.No it won't. If you limit it to 1 core, even if you have 128 of them, it will only use the one core for BOINC work. I limited the number of cores to use on one of my systems for a while to avoid out of memory errors until i upgraded the RAM. Grant Darwin NT |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2115 Credit: 41,115,238 RAC: 19,699 |
@Sid: Very good of you to go through all that so thoroughly for me! I've implemented most of your suggestions. And thanks for the tip about where application priority is assigned in Windows -- never been in that corner of Task Manager before. Neither had I until Boinc came along for me. I don't think I had any other reason to consider it for anything else I've ever done on my computer. I do now suspect that it was expired work units that may have hung up Rosetta. Since it fetched new ones and resumed processing yesterday, I'm now waiting to see when BOINC will fetch new work units for WCG, which it has yet to do since it finished the batch it processed yesterday. Today I've been messing with various commands to probe how that all works. Funny to think that after almost 20 years of running BOINC, I've learned more about it in the last two days than in the entire preceding time. Thanks again! I don't think it was expired work units. I suspect it was all down to the number of tasks you were aiming to run within limited RAM. Which is why allocating a bit more RAM - or - reducing the tasks running at the same time (thereby needing less RAM) is giving you the room to run successfully. And unnecessarily suspending tasks when they could run is the rest of it. And all WCG projects are way less demanding of RAM so there's never going to be a problem with them. I'll be interested to see what happens after you give it a few days. Hopefully all issues will be solved. |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,716,372 RAC: 18,198 |
The idea that IBM might also have done a security check on it makes me think I should use that version as I have very little faith in the standard Boinc Manager. What security problem could Boinc possibly have? You are in control of what projects it communicates with. Suspend when computer is on battery: Untick. Your computer doesn't run on battery so the selection is redundant. It's intended for laptops/portables that aren't plugged into the mains. I use that option on my desktop as it has a UPS. Suspend GPU computing when computer is in use: Untick. None of the projects you run offer GPU computing. And clear the field that mentions mouse/keyboard input - also redundant. Does it matter what it's set to if it's redundant? You may not have picked up on it, but Grant mentioned that Rosetta (and I think tasks from all projects under Boinc) run at 'Low' or 'Idle' priority. Except LHC which is in a virtual machine and doesn't seem to behave properly. If you open the task manager under Windows 10, go to the "Details" column, find a Rosetta task, right-click on it and hover the mouse over the "Set Priority" option, a little sub-window will open showing the priority Rosetta runs at. Actually priorities in Windows 10 are abysmal. I've often seen things at low priority getting more CPU than things at normal. Sometimes circumstances can arise when there's a conflict between Rosetta and other programs you run, but it's generally something pretty specialist. I've got Boinctasks on the monitor to my right, I can see what all 6 machines are doing all the time. Correction, it's spilled into two monitors now. 76 tasks at once. In the usage limits section: I use TThrottle to do a similar thing and it doesn't seem to cause problems. Except TThrottle is better, as you set a temperature and it adjusts the % continuously. I have it on the machine in here because it's too loud at full blast if the room is warm. And note that running all 4-cores at 100%, 100% of the time, with nothing set in the "when to suspend" section, that will push up your temperatures as well, so there's two reasons you may want to go back to 75% cores if it's problematic. Apart from laptops, I've never known a CPU overheat, even on stock fans. Just besure to keep your cache sizes small so you don't run into deadline problems. With Rosetta's 3 day deadline if you have 3 days of work NO other projects will crunch because their deadline will be further out than 3 days. Actually the others will crunch, as Boinc downloads small amounts of work each time, not the whole 3 days, it tops it up. If Rosetta has been using the CPU all of the time, the next call for work will be from another project. No it won't. If you limit it to 1 core, even if you have 128 of them, it will only use the one core for BOINC work. I didn't mean that, I meant if you do not restrict cores, then Boinc will restrict for you when your RAM runs out. |
Christopher Graesser Send message Joined: 26 Jan 16 Posts: 3 Credit: 1,192,390 RAC: 0 |
In the meantime, my BOINC Client works fine again, seems to be a temporary issue. All other projects worked fine, the rosetta website responded slowly, but after quite some time, it worked well again. Thanks to all. |
EHM-1 Send message Joined: 21 Mar 20 Posts: 23 Credit: 183,782 RAC: 0 |
Well, the adventure continues, with a new question about BOINC's work unit triage decisions. In the below task listing, can anyone explain this scenario, referring to the screenshot below? Note: With my current settings, BOINC processes three work units at a time.
system: up-to-date Windows 10, Intel quad-core 3.6 GHz processor, 8 GB RAM |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,716,372 RAC: 18,198 |
Since the Rosetta one (red arrows) has been waiting its turn, and has the earliest deadline in the list, I expect BOINC will start on it next. It doesn't do it that way, or Rosetta would hog your machine for a week as the other projects all have longer deadlines. It picks the next task according to your project weights - eg. if you have Rosetta and WCG on even weights, then it will try to do the same amount of each overall (averaged over some days - not sure how many). |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 3,846 |
Well, the adventure continues, with a new question about BOINC's work unit triage decisions. In the below task listing, can anyone explain this scenario, referring to the screenshot below? [snip] BOINC does NOT automatically choose to start the task with the nearest deadline first. It's more like: High priority (usually due to a deadline in less than 24 hours) tasks first. Projects in order by how much work is needed to restore the balance, skipping those that have no tasks ready to start. Within each project, usually from the oldest to the newest download time. If multiple tasks are marked with the same download time, something I have not identified. I suspect that is in the order of which the last of the task's input files were downloaded. You don't have to like this, but complaining on Rosetta@home is unlikely to change this. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1671 Credit: 17,526,840 RAC: 23,319 |
Why the WCG and not the Rosetta? I can only guess it has to do with the "switch between tasks" setting, which I currently have set to a little longer than a Rosetta work unit requires.No, it's not to do with the Switch between Taks setting- it is all about your Resource share settings. As BOINC does work for each project, it keeps track of the mount of work done. It will then balance what is done between the different projects, in order to meet your Resource share settings. When you add of remove projects, change cache settings, increase (and even more so decrease) the amount of time BOINC can process work it then has to re-juggle the work it does to match the work done on projects with the debt owed to other projects. It takes time for things to settle down. If you keep tweaking things, then they will never settle down. Since Rosetta units require a longer processing time and carry tighter deadlines than the WCG OpenPandemics, I would think it best for BOINC to be toiling on two Rosettas and one WCG at a time, thereby completing 2 of the former and 5 of the latter every 8 hours. But I don't see how to tweak BOINC to achieve that if it is in fact a valid pursuit. Any ideas?Leave it alone and let it sort itself out. The larger your cache, the more projects you run, the less time your projects have to actually do work, the longer it will take for things to settle down (think months). If BOINC is able to run whenever the system is running, and the system is running, and you run with no cache, then things will settle down within a week or 2. NB- Use at most xx% of CPU time is best set at 100%. Reduce the number of cores in use (Use at most xx% of the CPUs) if heat is an issue (or improve the system cooling). Suspend when non-BOINC CPU usage is above xx% is best not being selected at all. If BOINC processing does affect another programme, you can use the Exclusive Application option to stop BOINC when that particular programme is running. Grant Darwin NT |
EHM-1 Send message Joined: 21 Mar 20 Posts: 23 Credit: 183,782 RAC: 0 |
Thanks for that input, guys! I thought it might be a case of settling in to a routine. I've tweaked the settings according to Grant's suggestions, and will now sit back for a stretch and see how things shake out. Eric system: up-to-date Windows 10, Intel quad-core 3.6 GHz processor, 8 GB RAM |
mikey Send message Joined: 5 Jan 06 Posts: 1895 Credit: 9,118,186 RAC: 6,004 |
Thanks for that input, guys! I thought it might be a case of settling in to a routine. I've tweaked the settings according to Grant's suggestions, and will now sit back for a stretch and see how things shake out. It also had to do with the fact that the 2nd of July is still 3 days away!!! Boinc thinks your Rosetta task will take 8 hours of which there are still 7 of those between right now and the deadline on the 2nd off July, plenty of time to do a single workunit. |
EHM-1 Send message Joined: 21 Mar 20 Posts: 23 Credit: 183,782 RAC: 0 |
Follow-up to my previous post, just out of curiosity: Why would BOINC interrupt a work unit in progress to start another? I promise I'll hold off on questions now... Eric system: up-to-date Windows 10, Intel quad-core 3.6 GHz processor, 8 GB RAM |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2024 University of Washington
https://www.bakerlab.org