Message boards : Number crunching : Rosetta 4.0+
Previous · 1 . . . 12 · 13 · 14 · 15 · 16 · 17 · 18 . . . 19 · Next
Author | Message |
---|---|
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
Application version: Rosetta v4.07 windows_x86_64 Device: 3710630, Task: 1129189352, and WU: 1017085044. Name: 6mm7mv4g_3h3_design_COVID-19_SAVE_ALL_OUT_902608_1 Status: Error while computing Exit status: 1 (0x00000001) Unknown error code Incorrect function. (0x1) - exit code 1 (0x1) Could this type of error be caused or contributed to by insufficient host memory (RAM)? |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
Application version: Rosetta v4.07 windows_intelx86 Device: 1759960, Task: 1130640917, and WU: 1018413849. Name: 9eq5wp3x_3h3_design3_COVID-19_SAVE_ALL_OUT_902888_1_0 Status: Completed and validated Exit status: 0 (0x00000000) Though the task was valid, it did end prematurely because of the following errors: ERROR: Assertion `copy_pose.size() == native.size()` failed. MSG: The reference pose must be the same size as the working pose Good to see got credit for what was done, however. Better than throwing all the crunching out and starting from the beginning. Maybe this particular task type/code will need to be reviewed if this type of error continues. |
James W Send message Joined: 25 Nov 12 Posts: 130 Credit: 1,766,254 RAC: 0 |
Application version: Rosetta v4.07 windows_intelx86 Name: 9ly9pu7b_3h3_design3_COVID-19_SAVE_ALL_OUT_902893_1_0 Same error as above, except host 3710630. Task: 1130645238 and WU: 1018417803. |
rsNeutrino Send message Joined: 22 Mar 20 Posts: 10 Credit: 4,870,239 RAC: 7,963 |
I seem to get these errors en masse, particularly shortly after resuming work after a restart of my machine / BOINC, with and without having hit "pause for 1h" before shutdown. ERROR: Assertion `copy_pose.size() == native.size()` failed. MSG:the reference pose must be the same size as the working pose ERROR:: Exit from: ......srcprotocolsprotein_interface_designfiltersRmsdFilter.cc line: 323 06:28:45 (14172): called boinc_finish(0) Tasks: https://boinc.bakerlab.org/rosetta/result.php?resultid=1131628501 https://boinc.bakerlab.org/rosetta/result.php?resultid=1131588448 https://boinc.bakerlab.org/rosetta/result.php?resultid=1131588449 https://boinc.bakerlab.org/rosetta/result.php?resultid=1131317068 https://boinc.bakerlab.org/rosetta/result.php?resultid=1131002632 https://boinc.bakerlab.org/rosetta/result.php?resultid=1131002634 https://boinc.bakerlab.org/rosetta/result.php?resultid=1131002636 https://boinc.bakerlab.org/rosetta/result.php?resultid=1131002638 Edit: 3 more: https://boinc.bakerlab.org/rosetta/result.php?resultid=1132223866 https://boinc.bakerlab.org/rosetta/result.php?resultid=1132223881 https://boinc.bakerlab.org/rosetta/result.php?resultid=1132223998 |
GLadi Send message Joined: 21 Jan 07 Posts: 3 Credit: 303,172 RAC: 0 |
Is there a safe way to pause WUs and resume them later? I'm asking because I'm getting the same error: ERROR: Assertion `copy_pose.size() == native.size()` failed. MSG:the reference pose must be the same size as the working pose ERROR:: Exit from: ......srcprotocolsprotein_interface_designfiltersRmsdFilter.cc line: 323 This happens after restarting the system. Some WUs end with errors (no points granted), some WUs end at percentage they were before restarting (points partially granted) and some WUs continue to process as it should be. |
torma99 Send message Joined: 16 Feb 20 Posts: 14 Credit: 288,937 RAC: 0 |
I just found Rosetta 4.07 used 2,111,242,240 bytes (1.97 GIGAbytes) before my system crashed (i7-4770K, 8GB). This seems to be just a bit more than expected, so please take a look and fix the problem. For my rig (16 GB of ram) running on 4 cores. It consumes almost the same. 1,9-2,2 GB and does not causes problems, 8 gig can be somewhat small, if you use your browser with some open tabs next to Rosetta. |
rsNeutrino Send message Joined: 22 Mar 20 Posts: 10 Credit: 4,870,239 RAC: 7,963 |
For my rig (16 GB of ram) running on 4 cores. It consumes almost the same. 1,9-2,2 GB and does not causes problems, 8 gig can be somewhat small, if you use your browser with some open tabs next to Rosetta. In my case BOINC is configured so that it can use 80% of 32GB RAM at all times, running with 14 rosetta threads on a Ryzen 1700 with 8 cores and 16 CPU threads available. 15 GB RAM has been sitting empty when the errors occured. Changed to 8 rosetta threads for now... |
Peti Send message Joined: 17 Mar 20 Posts: 5 Credit: 142,053 RAC: 0 |
Hi everyone, I am sometimes seeing very similar error messages. ERROR: std::abs( coordsys_rot.det() - 1.0 ) < 1e-6 ERROR:: Exit from: src/core/pose/symmetry/util.cc line: 884 and ERROR: Assertion `copy_pose.size() == native.size()` failed. MSG:the reference pose must be the same size as the working pose ERROR:: Exit from: src/protocols/protein_interface_design/filters/RmsdFilter.cc line: 323 and https://boinc.bakerlab.org/rosetta/result.php?resultid=1131649536 _64-pc-linux-gnu': free(): invalid pointer: 0x00000000080e29a8 *** *** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': free(): invalid pointer: 0x00000000080e29a8 *** and https://boinc.bakerlab.org/rosetta/result.php?resultid=1131646232 _64-pc-linux-gnu': double free or corruption (!prev): 0x00000000060b5a60 *** *** Error in `../../projects/boinc.bakerlab.org_rosetta/rosetta_4.08_x86_64-pc-linux-gnu': double free or corruption (!prev): 0x00000000060b5a60 *** (maybe these last two were due to bad cpu overclock? or maybe not?) I just found this thread, after I already posted my problems here https://boinc.bakerlab.org/rosetta/forum_thread.php?id=13629&postid=92306#92306 The tasks seems "Completed and validated" on the webpage. Why is that, if there is error? Whom should I tell that my PC might have made mistakes that are unnoticed? I don't want to mix bad results into good data... |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Is there a safe way to pause WUs and resume them later? I'm asking because I'm getting the same error: The BOINC Manager should be able to take care of it. One approach to preserve everything, is just to sleep the machine. If you were wanting to use the machine, the BOINC Manager does have an option to pause. I'd suggest you pause the R@h project, otherwise it fires up the next work unit in the line when you suspect the one that is running. Rosetta Moderator: Mod.Sense |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
Whom should I tell that my PC might have made mistakes that are unnoticed? No need to worry about it. The ProjectTeam can identify any bad results. Rosetta Moderator: Mod.Sense |
Peti Send message Joined: 17 Mar 20 Posts: 5 Credit: 142,053 RAC: 0 |
No need to worry about it. The ProjectTeam can identify any bad results. Thank you, then I don't worry. |
Kipni Send message Joined: 24 Mar 20 Posts: 5 Credit: 323,369 RAC: 0 |
Hello, i've started a few days ago with R@H and i seem to have a lot of computation errors on 2 of my rigs. is this normal? See screenshot below. |
adrianxw Send message Joined: 18 Sep 05 Posts: 653 Credit: 11,840,739 RAC: 225 |
I have had 4 wu's fail recently, 3 after about 40 minutes, the other about 3 hours. I have had more trouble with Rosetta this year than since the project started. I don't know if they have rushed new stuff through for the Corona virus, they might have done so. Wave upon wave of demented avengers march cheerfully out of obscurity into the dream. |
[VENETO] boboviz Send message Joined: 1 Dec 05 Posts: 1991 Credit: 9,520,400 RAC: 12,860 |
Hello, Seems memory problems. How much ram do you have? |
Kipni Send message Joined: 24 Mar 20 Posts: 5 Credit: 323,369 RAC: 0 |
On one machine i have 8GB, on the other machine i have 16GB So for the machine with 8GB you may be right because 90% is used. But on the other machine only 50% memory is used. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 4,447 |
How much of this memory have you allowed BOINC to use? That's often more important than the total you have. |
Kipni Send message Joined: 24 Mar 20 Posts: 5 Credit: 323,369 RAC: 0 |
These are my memory settings. Or do u mean something else? |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 4,447 |
That looks like suitable memory settings. However, calculate how much 80% or 90% of 8 GB is to see if you should expect memory problems on that computer. You may have to reduce the number of cores BOINC can use at once to avoid memory problems. |
Sid Celery Send message Joined: 11 Feb 08 Posts: 2114 Credit: 41,105,271 RAC: 21,658 |
These are my memory settings. Or do u mean something else? You allow more memory than me - that's not the issue. It was discovered a long time ago that we need to have the "Leave non-GPU tasks in memory while suspended" box ticked, otherwise weird errors crop up. I'm not sure anyone explained why, but ticking it makes problems go away. I don't know why it isn't the default setting. Also, no-one ever tells you about it. Try it and see how it goes for a day. |
robertmiles Send message Joined: 16 Jun 08 Posts: 1232 Credit: 14,269,631 RAC: 4,447 |
I just saw a 24.01 KB zip file being downloaded. 24.0 KB appeared to download at normal speed, then it was several seconds before it downloaded the last 0.01 KB. In other words, the larger zip files aren't fully exempt from the problem; they just aren't affected severely enough to shut down Rosetta@Home new tasks. |
Message boards :
Number crunching :
Rosetta 4.0+
©2024 University of Washington
https://www.bakerlab.org