Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 300 · Next

AuthorMessage
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 81560 - Posted: 8 Jun 2017, 15:39:01 UTC

I have has four b21 and b22 work units that slowed to a crawl on two Ubuntu machines (both i7-3770) that were set to run for 24 hours. However, they all completed and reported without error after 4 1/2 more hours (28 1/2 hours total). So I would just let them run.
ID: 81560 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 12,120,035
RAC: 0
Message 81564 - Posted: 9 Jun 2017, 1:16:19 UTC

I'm seeing problems with these b* tasks also. This one, b21_ncst_s2_0601.108._fold_and_dock_SAVE_ALL_OUT_486796_197,
830463634 , a big all-sheet hexamer, got 'stuck' (although it was clearly doing something) on Model 2 Step 7205 for at least 6 hours before finally expiring with a Validate error.

Linux/Boinc 7.6.31
ID: 81564 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 22,677,186
RAC: 6,531
Message 81569 - Posted: 9 Jun 2017, 20:53:39 UTC

Reporting that the task named .86.kin_propCO_6.5_23.0_7.0_t0153_0010_0001_fold_and_dock_SAVE_ALL_OUT_486512_197_0 ran more than 10 hours; my Target CPU run time is 6 hours.

That completes my report. However, I do not understand why I only got 48.8 credits, when the more normal tasks that take roughly 6 hours always get more than 100.
ID: 81569 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1993
Credit: 9,520,400
RAC: 11,365
Message 81572 - Posted: 10 Jun 2017, 7:34:00 UTC - in response to Message 81564.  

I'm seeing problems with these b* tasks also. ......
Linux/Boinc 7.6.31


Same here problems with b*
Windows 10

ID: 81572 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
svincent

Send message
Joined: 30 Dec 05
Posts: 219
Credit: 12,120,035
RAC: 0
Message 81574 - Posted: 10 Jun 2017, 15:58:41 UTC

Another issue with these overrunning tasks: checkpointing isn't working properly. I've got a couple of b21* tasks that have been running for over 12 hours and are stuck on step 7205 having done 52 and 12 tasks respectively. Yet the last checkpoint for each is 18 minutes.
ID: 81574 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile robertmiles

Send message
Joined: 16 Jun 08
Posts: 1232
Credit: 14,269,631
RAC: 3,846
Message 81576 - Posted: 11 Jun 2017, 17:16:26 UTC - in response to Message 81574.  
Last modified: 11 Jun 2017, 17:21:55 UTC

Another issue with these overrunning tasks: checkpointing isn't working properly. I've got a couple of b21* tasks that have been running for over 12 hours and are stuck on step 7205 having done 52 and 12 tasks respectively. Yet the last checkpoint for each is 18 minutes.

A lot of compute errors lately for workunits with names beginning with b21_ or b22_. For most of them finished by a wingmate, the wingmate gave a compute error also.

Could you check if this series of workunits has built-in errors?

Running under 64-bit Windows 10.

Not overrunning; they hit a compute error when about a quarter done.

Problem not seen for workunits with names beginning with anything else.
ID: 81576 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Patrick T.

Send message
Joined: 10 May 17
Posts: 1
Credit: 220,116
RAC: 0
Message 81587 - Posted: 13 Jun 2017, 14:29:52 UTC

Hi to all, i don't know if this space is right or not, if no please move the right space.

When running rosetta software, minirosetta, by BOINC client, if click on show graphics the application hang and analysis of data hanging on that percentage.

My pc is equipped with windows 10 home.

Best regards
ID: 81587 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Mod.Sense
Volunteer moderator

Send message
Joined: 22 Aug 06
Posts: 4018
Credit: 0
RAC: 0
Message 81588 - Posted: 13 Jun 2017, 15:16:57 UTC

Thanks Patrick for the report. Can you get a link to or the name of the specific task name you were trying to display in the graphic? Have you seen it happen on more than one? In your preferences, what do you have set for "Percentage of CPU time used for graphics"?
Rosetta Moderator: Mod.Sense
ID: 81588 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2115
Credit: 41,107,773
RAC: 19,731
Message 81592 - Posted: 14 Jun 2017, 0:35:36 UTC - in response to Message 81576.  

Another issue with these overrunning tasks: checkpointing isn't working properly. I've got a couple of b21* tasks that have been running for over 12 hours and are stuck on step 7205 having done 52 and 12 tasks respectively. Yet the last checkpoint for each is 18 minutes.

A lot of compute errors lately for workunits with names beginning with b21_ or b22_. For most of them finished by a wingmate, the wingmate gave a compute error also.

Could you check if this series of workunits has built-in errors?

Running under 64-bit Windows 10.

Not overrunning; they hit a compute error when about a quarter done.

Problem not seen for workunits with names beginning with anything else.

Same here under Windows7-64bit

b21_ncst_s2_0601.154._fold_and_dock_SAVE_ALL_OUT_486842_154_1
Crashed out midway through 4hrs into an 8hr run

<message>
(unknown error) - exit code -1073741819 (0xc0000005)
</message>

...

Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x00A6CEF9 read attempt to address 0x2BAA4000

Engaging BOINC Windows Runtime Debugger...


b22_1_0603.39._fold_and_dock_SAVE_ALL_OUT_487004_245_0
Over-ran - watchdog cut in

BOINC:: CPU time: 43437.5s, 14400s + 28800s[2017- 6-13 14:10:10:] :: BOINC

...

Validate state Valid
Claimed credit 397.853916996678
Granted credit 177.706745808389


ID: 81592 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2115
Credit: 41,107,773
RAC: 19,731
Message 81594 - Posted: 14 Jun 2017, 15:19:18 UTC
Last modified: 14 Jun 2017, 15:19:42 UTC

Is this another over-runninglow-credit task type?

.92.kin_propCO_6.5_23.0_7.0_t0154_0005_0005_fold_and_dock_SAVE_ALL_OUT_486518_359_0
BOINC:: CPU time: 43783.5s, 14400s + 28800s[2017- 6-12 22:20:20:] :: BOINC

...

Validate state Valid

Claimed credit 263.27551784022
Granted credit 40.2171725261211

ID: 81594 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1993
Credit: 9,520,400
RAC: 11,365
Message 86700 - Posted: 23 Jun 2017, 7:56:34 UTC

Please,
- insert links "Home | Join | About | Participants | Community | Statistics" in the footer of forum, like old site
- insert "last modified" to know updates of forum

Do you plan to update Ralph@Home server??
ID: 86700 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 86703 - Posted: 23 Jun 2017, 9:22:38 UTC - in response to Message 86700.  

Those are up at the top now. It looks very nice to me.
ID: 86703 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile [VENETO] boboviz

Send message
Joined: 1 Dec 05
Posts: 1993
Credit: 9,520,400
RAC: 11,365
Message 86704 - Posted: 23 Jun 2017, 9:43:09 UTC - in response to Message 86703.  
Last modified: 23 Jun 2017, 9:45:32 UTC

Those are up at the top now. It looks very nice to me.


Yes, but the last message is at the end of the page and if i want to return to home page....

P.S. Very good the possibility to see, in server status page, the queue of android wus...
ID: 86704 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 86705 - Posted: 23 Jun 2017, 11:55:04 UTC - in response to Message 86704.  
Last modified: 23 Jun 2017, 11:55:54 UTC

Yes, but the last message is at the end of the page and if i want to return to home page....

I am beginning to get into the habit of listing the newest messages first, since there are a lot of long discussions (including this one and its predecessor).
So it is about the same for me, depending on the forum. Both might be nice, but I am happy enough as it is.
ID: 86705 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Defender

Send message
Joined: 22 Mar 08
Posts: 10
Credit: 13,517,861
RAC: 1,351
Message 86707 - Posted: 23 Jun 2017, 17:25:59 UTC
Last modified: 23 Jun 2017, 17:26:15 UTC

Can you add the server details to the server status page? We are very interested in the hardware.
ID: 86707 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 86710 - Posted: 23 Jun 2017, 19:47:22 UTC - in response to Message 86707.  

done!
ID: 86710 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 86711 - Posted: 23 Jun 2017, 19:51:51 UTC - in response to Message 86700.  

Please,
- insert links "Home | Join | About | Participants | Community | Statistics" in the footer of forum, like old site
- insert "last modified" to know updates of forum

Do you plan to update Ralph@Home server??



I'll add these links soon.

We do plan on updating Ralph but probably later in July or Aug since I'll be on vacation soon.
ID: 86711 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JohnntH

Send message
Joined: 8 Jan 07
Posts: 2
Credit: 30,254,638
RAC: 443
Message 86715 - Posted: 24 Jun 2017, 15:07:36 UTC

Not sure if this is correct thread or not, so please move it if I am wrong. I love the new changes and have just one question, I can not recall if you are pushing or if the combined sites are pulling but BOINCstats, Free DC, and BOINC Combined Statistics have not updated since the upgrade. Also not sure if BOINC Statistics for the WORLD! still works as it gives an error message when trying to connect from both here and at least one other project. I know this should be low on priority list, but it is the accountant in me not liking the lack of balance, I know I just added to the stereotype of accountants, but some stereotypes have a grain of truth in them.
Thank You
ID: 86715 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
entigy

Send message
Joined: 2 Nov 05
Posts: 5
Credit: 990,830
RAC: 0
Message 86716 - Posted: 24 Jun 2017, 15:09:01 UTC

Willy from BoincStats is reporting that your "Stats exports directory is empty", hence we're not getting any credit for the results we've returned in the past couple of days...
ID: 86716 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile David E K
Volunteer moderator
Project administrator
Project developer
Project scientist

Send message
Joined: 1 Jul 05
Posts: 1018
Credit: 4,334,829
RAC: 0
Message 86717 - Posted: 24 Jun 2017, 15:12:40 UTC - in response to Message 86716.  

Willy from BoincStats is reporting that your "Stats exports directory is empty", hence we're not getting any credit for the results we've returned in the past couple of days...


I'll look into this later today.
ID: 86717 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 11 · 12 · 13 · 14 · 15 · 16 · 17 . . . 300 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org