Message boards : Number crunching : Problems and Technical Issues with Rosetta@home
Previous · 1 . . . 203 · 204 · 205 · 206 · 207 · 208 · 209 . . . 300 · Next
Author | Message |
---|---|
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
Sid, don't you use a disk cleaner? I have never had your issue. After that 0 value error thing and before the guys at github figured it out, I dropped some cash on a new SDD to put the data on. You might consider that if your budget allows it. Then you split program files to c: and data to d: (or whatever). Then data can have all the space it needs and only BOINC program files reside on c: On my c: I have only 91 free from 220 (only boinc program files are here) but on d: I have 419 out of 450 freeand the data from 6 projects uses only 15.2. Rosetta is 8.63 of this total. 5,357 files and 424 folders. You might not need 450GB and maybe something like a 250GB drive would work and just move data to there, then no matter what it wants, there is always lots of room. It's and idea, not sure if your bank account would like that idea or not. |
Paddles Send message Joined: 15 Mar 15 Posts: 11 Credit: 5,337,832 RAC: 2,215 |
I'm encountering a new problem, has anyone else see it?. I have three Python tasks in the state "Postponed: VM Hypervisor failed to enter an online state in a timely fashion." I'm running BOINC 7.6.20 and was on VirtualBox 6.1.32 - combination had been working generally happy, and I haven't changed any BOINC settings recently (or had any significant changes to available disk space). I've updated VBox to 6.1.34 to see if that resolves it, but it looks like the Python tasks are being postponed for a day and none of the others have started. |
kotenok2000 Send message Joined: 22 Feb 11 Posts: 258 Credit: 483,503 RAC: 109 |
Can you unupdate to 5.2.44? |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2117 Credit: 41,162,648 RAC: 14,990 |
And it's weird you have so many .tmp files. I thought BOINC deleted all files once the data was uploaded and reported? Ok, I've just used Windows Disk cleanup and ensured storage sense is enabled and freed up a few Gb, but that's on a PC that isn't running VBox I'll give that a go when I get back to my main PC tomorrow evening |
Paddles Send message Joined: 15 Mar 15 Posts: 11 Credit: 5,337,832 RAC: 2,215 |
Can you unupdate to 5.2.44? I'll give that a try if there are continued problems. I just went back and looked, the existing tasks are still postponed (1 day hasn't elapsed yet, and there isn't an obvious way to manually resume them), but one of the other python tasks seems to be running ok so maybe it was just a transient issue. |
Paddles Send message Joined: 15 Mar 15 Posts: 11 Credit: 5,337,832 RAC: 2,215 |
Update: The first task to be postponed reached the end of its one day postponement, and now appears to be computing successfully (in VBox 6.1.34). Haven't tried reverting to previous version to see what happens, but whatever the problem was it seems to have resolved. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 2,123 |
Can you unupdate to 5.2.44? I've found a way to manually resume such tasks. Suspend all tasks, then exit BOINC. I've forgotten if it's necessary to restart Windows at this point. Then restart BOINC and tell it to resume tasks. , |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,717,270 RAC: 10,117 |
Ok, I've just used Windows Disk cleanup and ensured storage sense is enabled and freed up a few Gb, but that's on a PC that isn't running VBoxI run the Windows disk cleanup (including system files) then run treesize which shows me what folders are using the most, so I can manually remove stuff I don't want anymore. Last time I reduced the stuff on my disk by a third. I only bother doing this when the line changes from blue to red in windows explorer. |
Jean-David Beyer Send message Joined: 2 Nov 05 Posts: 187 Credit: 6,380,354 RAC: 5,710 |
I run Linux and have never run out of disk space (because spinning hard drives are now so big and so cheap). I have about 512 GBytes of ssd, and two 4-Terabyte spinning hard drives. But on Linux, you can find out how your disk space is being used very easily. Here is what is in my /var/lib /boinc directory and everything under it. To keep from boring you, I printed out only the first 24 lines. The numbers are in 1024-byte blocks. Right now, I have only universe and rosetta tasks running on my machine. So I seem to be using about 2.37 GigaBytes of disk space in that partition that is sized at about 500 GigaBytes of size. When I have a lot of ClimatgeaPrediction tasks and WCG tasks, I use a lot more, but even then, I come nowhere close to using it all. [/var/lib/boinc]$ du . | sort -nr | head -n 24 2373204 . 2282044 ./projects 1763172 ./projects/boinc.bakerlab.org_rosetta 996448 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database 996448 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl 454540 ./projects/www.worldcommunitygrid.org 310248 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/chemical 273928 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/chemical/pdb_components 243200 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/sampling 236248 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring 191412 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring/score_functions 190452 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/rotamer 91416 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/rotamer/ncaa_rotlibs 86112 ./slots 84812 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/rotamer/ncaa_rotlibs/ncaa_rotamer_libraries 58676 ./projects/climateprediction.net 53652 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring/score_functions/rama 51688 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring/score_functions/mhc_epitope 45804 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/rotamer/ncaa_rotlibs/ncaa_rotamer_libraries/n_methyl_amino_acid 39948 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring/score_functions/P_AA_pp 39672 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring/score_functions/P_AA_pp/shapovalov 37292 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring/score_functions/P_AA_pp/shapovalov/2.5deg 34520 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/chemical/residue_type_sets 32532 ./projects/boinc.bakerlab.org_rosetta/database_357d5d93529_n_methyl/minirosetta_database/scoring/motif As far as .tmp files are concerned, there are very few: here are all of them: [/var/lib/boinc]$ du -a | grep tmp 484 ./slots/0/data0.tmp 0 ./slots/0/data1.tmp 0 ./slots/0/data2.tmp 12 ./slots/0/error.tmp 228 ./slots/1/data0.tmp 0 ./slots/1/data1.tmp 0 ./slots/1/data2.tmp 8 ./slots/1/error.tmp 4 ./slots/2/rosetta_tmp.txt 512 ./slots/3/data0.tmp 0 ./slots/3/data1.tmp 0 ./slots/3/data2.tmp 12 ./slots/3/error.tmp 4 ./slots/4/rosetta_tmp.txt 4 ./slots/5/rosetta_tmp.txt 4 ./slots/6/rosetta_tmp.txt 4 ./slots/7/rosetta_tmp.txt |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,717,270 RAC: 10,117 |
I run Linux and have never run out of disk space (because spinning hard drives are now so big and so cheap).You forgot "and slow". I've banned spinning disks from anything boinc related in my house. I have 7 PCs running Boinc and it's difficult to control them all when one is sat waiting on a disk! The only things rust spinners are used for is backups, TV/Film storage, and security cameras. |
.clair. Send message Joined: 2 Jan 07 Posts: 274 Credit: 26,399,595 RAC: 0 |
Is it time to panic ? There are less than a million tasks left on the front page . . . Does this mean we may run out of pythons sometime this year :-) and then what will we do for `entertainment` |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
Is it time to panic ? I am wondering that myself. The pythons are from a single researcher, and I don't know if there will be more. Maybe it is just a one-shot experiment? Since they never tell us anything, planning is not possible. |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,717,270 RAC: 10,117 |
Is it time to panic ?Play with WCG. If they ever work out how to move a server from one building to another. Another delay until 9th May.... |
Grant (SSSF) Send message Joined: 28 Mar 20 Posts: 1673 Credit: 17,646,045 RAC: 22,562 |
Is it time to panic ?Given that the most In progress for them was a bit over 21,000, they tend to average around 15,000 or less, and that there are presently only 10,500 In progress, i think it will be a long, long, long time before they get cleared due to the very minuscule number of systems that are actually processing them. Grant Darwin NT |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,717,270 RAC: 10,117 |
I make that four months. Depends how soon you want to panic. Anyway why panic when there's about 50 projects to play with? I'm off doing Milkyway (DP cards), Cosmology (CPUs), and Folding (SP cards) just now.Is it time to panic ?Given that the most In progress for them was a bit over 21,000, they tend to average around 15,000 or less, and that there are presently only 10,500 In progress, i think it will be a long, long, long time before they get cleared due to the very minuscule number of systems that are actually processing them. |
Greg_BE Send message Joined: 30 May 06 Posts: 5691 Credit: 5,859,226 RAC: 0 |
I make that four months. Depends how soon you want to panic.Is it time to panic ?Given that the most In progress for them was a bit over 21,000, they tend to average around 15,000 or less, and that there are presently only 10,500 In progress, i think it will be a long, long, long time before they get cleared due to the very minuscule number of systems that are actually processing them. And then we get to have fun with the buggy stuff. |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,717,270 RAC: 10,117 |
And then we get to have fun with the buggy stuff.I prefer dune buggies. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 2,123 |
I make that four months. Depends how soon you want to panic.Is it time to panic ?Given that the most In progress for them was a bit over 21,000, they tend to average around 15,000 or less, and that there are presently only 10,500 In progress, i think it will be a long, long, long time before they get cleared due to the very minuscule number of systems that are actually processing them. I prefer both to the situation at Predictor@Home. They lost the two members of their project team who knew how the create useful new workunits (probably because they graduated). For several months, they kept the project running by repeatedly raising the number of times a workunit could fail before no more tasks would be sent out for it. Some of the remaining workunits failed over 30 times before the professor in charge decided it was not worthwhile to let the project continue, and it shut down. |
Jim1348 Send message Joined: 19 Jan 06 Posts: 881 Credit: 52,257,545 RAC: 0 |
And then we get to have fun with the buggy stuff. This IS the buggy stuff. That is one reason I am concerned we may not get more. They did not bother to fix it, so it may be good enough for what they need it for. |
Mr P Hucker Send message Joined: 12 Aug 06 Posts: 1600 Credit: 11,717,270 RAC: 10,117 |
I prefer both to the situation at Predictor@Home. They lost the two members of their project team who knew how the create useful new workunits (probably because they graduated). For several months, they kept the project running by repeatedly raising the number of times a workunit could fail before no more tasks would be sent out for it. Some of the remaining workunits failed over 30 times before the professor in charge decided it was not worthwhile to let the project continue, and it shut down.ROFL, Wikipedia says "Though it was quite successful, a "disagreement" between the project administration and the user base caused a mass exodus of participating users" |
Message boards :
Number crunching :
Problems and Technical Issues with Rosetta@home
©2024 University of Washington
https://www.bakerlab.org