Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 138 · 139 · 140 · 141 · 142 · 143 · 144 . . . 300 · Next
Author | Message |
---|---|
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
no more work ? really? I am continuing to get work, though most of it is _1. And the new ones seem to be taking longer, or at least the estimates are. It may just be how my machines are set up. I am doing more work units now, but they may be cache-limited. |
.clair. Send message Joined: 2 Jan 07 Posts: 274 Credit: 26,399,595 RAC: 0 |
no more work ? really? You may be getting the ones that I `aborted` . `errored` . `crashed` wimin drivers . . . . :) Nnnnnn,, |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
Abort some more. They are really out of pythons now, though I did pick up a few of the regular Rosettas. But even they seem to be out now. On some of my machines, I can make it until tomorrow. On others, I can't. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
Abort some more. They are really out of pythons now, though I did pick up a few of the regular Rosettas. Hurry up...3 left |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
Hurry up...3 leftYes, you have to get them when you can. But I pick up a few more from time to time, so I should make it until tomorrow. Hopefully they will throw some more in the hopper. |
nwayno Send message Joined: 28 May 20 Posts: 6 Credit: 7,006,260 RAC: 29 |
Yes there has been no work units for several weeks. I switched to World Community Grid. My raspberry pi's have nothing to do, so I am powering those off. It would certainly help, as you said something like: Yeah, it's broke, we're working on it. I will check in again after the first of the year as well. |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1673 Credit: 17,601,972 RAC: 22,407 |
Yes there has been no work units for several weeks.Not true. There have been periods over the last 3 weeks where there have been no new Rosetta 4.20 Tasks available from the project- there were 4 days with no new work from Nov 11th, after that it was generally 1-2 days between spurts of new Rosetta 4.20 work. Along with the occasional batch of RB Tasks being sent out as well. But it has been just the last 36 hours or so where there has been no new Python work available either, just the very occasional resend. Grant Darwin NT |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 187 Credit: 6,370,872 RAC: 5,700 |
Yes, you have to get them when you can. But I pick up a few more from time to time, so I should make it until tomorrow. I see there are lots of them in the hopper, Rosetta 3674 63066 6.88 (0.24 - 56.39) 2050 and my machine asks for some, but does not get any. Mon 29 Nov 2021 08:40:33 AM EST | Rosetta@home | update requested by user Mon 29 Nov 2021 08:40:38 AM EST | Rosetta@home | Sending scheduler request: Requested by user. Mon 29 Nov 2021 08:40:38 AM EST | Rosetta@home | Requesting new tasks for CPU Mon 29 Nov 2021 08:40:40 AM EST | Rosetta@home | Scheduler request completed: got 0 new tasks Mon 29 Nov 2021 08:40:40 AM EST | Rosetta@home | No tasks sent Mon 29 Nov 2021 08:40:40 AM EST | Rosetta@home | Project requested delay of 31 seconds Mon 29 Nov 2021 08:41:15 AM EST | Rosetta@home | Sending scheduler request: To fetch work. Mon 29 Nov 2021 08:41:15 AM EST | Rosetta@home | Requesting new tasks for CPU Mon 29 Nov 2021 08:41:16 AM EST | Rosetta@home | Scheduler request completed: got 0 new tasks Mon 29 Nov 2021 08:41:16 AM EST | Rosetta@home | No tasks sent |
Peter Humphrey Send message Joined: 26 Jul 18 Posts: 5 Credit: 4,256,666 RAC: 4,788 |
In that case I may decide I can't afford to stay with this project. It's far too much of a memory hog; I've suspended it while I debate with myself. Who'd have though that 64GB RAM would be too little, even with 24 processors? |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
Who'd have though that 64GB RAM would be too little, even with 24 processors? If you are willing to jump through some hoops (though they are actually rather easy), there is a way, by running multiple BOINC instances. https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=103516#103516 You then run as many as you can (8 for example) in each instance. That works because you only need a lot of memory to download them, not run them. Here is how to set it up: https://www.overclock.net/threads/guide-setting-up-multiple-boinc-instances.1628924/ I already had a second BOINC instance set up on a Ryzen 3900X with 96 GB of memory, so I can use all 24 cores (12 per instance). Also, I added a second BOINC instance to a Ryzen 3950X with 128 GB of memory. They are all under Ubuntu 20.04.3, but it works on Windows as well. It is just a bit easier to start up automatically in Linux. You can use three BOINC instances (or more) if you need to. |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 187 Credit: 6,370,872 RAC: 5,700 |
I messed around with app_config but that can make a mess of things. I am using this, and that sets my upper bound to three at a time. What would be the symptoms of the mess to which you refer? [/var/lib/boinc/projects/boinc.bakerlab.org_rosetta]$ cat app_config.xml <app_config> <project_max_concurrent>3</project_max_concurrent> </app_config> |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
I messed around with app_config but that can make a mess of things. project? I have never seen that before. Most of us tried to use max_concurrent and then got buried in tons of tasks we could never complete by their deadlines. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
Something does not make sense Says 2,000 tasks queued. Had a look at the schedulers...0 on all projects. So are the 2,000 not released yet or have they all been taken and the system did not update? |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
project? I have never seen that before. Project_max_current will limit the total number of work units running for all projects. But either one of them can cause the problem of excessive downloads. https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5720&postid=45319 https://lhcathome.cern.ch/lhcathome/forum_thread.php?id=5720&postid=45323 |
Falconet Send message Joined: 9 Mar 09 Posts: 353 Credit: 1,227,479 RAC: 3,710 |
The queue only updates once around every 4 hours while the server status page is around 30 minutes or so. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
project? I have never seen that before. So it comes down to this, any attempt to limit the amount of tasks will cause excessive downloads. What if you rolled back in versions of BOINC? |
Bryn Mawr Send message Joined: 26 Dec 18 Posts: 389 Credit: 12,073,013 RAC: 8,289 |
project? I have never seen that before. Not will, can. I’ve been running project_max_concurrent on most projects for several years with no excess downloads. |
Peter Humphrey Send message Joined: 26 Jul 18 Posts: 5 Credit: 4,256,666 RAC: 4,788 |
You then run as many as you can (8 for example) in each instance. That works because you only need a lot of memory to download them, not run them. Here is how to set it up: I don't understand. When I run boincmgr it shows several jobs as "waiting for memory". How can adding yet more of them release memory? And why would boinc need more memory to download a job than to run it? (This is Gentoo Linux.) |
Falconet Send message Joined: 9 Mar 09 Posts: 353 Credit: 1,227,479 RAC: 3,710 |
You then run as many as you can (8 for example) in each instance. That works because you only need a lot of memory to download them, not run them. Here is how to set it up: Read my post. I hope it helps. https://boinc.bakerlab.org/rosetta/forum_thread.php?id=6893&postid=103572 |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
Falconet has the right answer. But I would only add that it is the project that sets the memory requirements, not BOINC. If they say more, then BOINC just obeys. (The memory isn't released, it is just reserved.) And adding a second BOINC instance gives you another bite at the apple. One BOINC instance doesn't see what the other one is doing. So if the pythons ever do require more memory to run, that could cause problems. But we are a long way from that at the moment. |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2024 University of Washington
https://www.bakerlab.org