Problems and Technical Issues with Rosetta@home

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home

To post messages, you must log in.

Previous · 1 . . . 178 · 179 · 180 · 181 · 182 · 183 · 184 . . . 299 · Next

AuthorMessage
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104940 - Posted: 18 Feb 2022, 16:02:11 UTC - in response to Message 104933.  

Kent - you just got the short straw. That's all the movingstubs tasks that everyone is talking about.
As far as we can tell the whole batch was not beta tested first and just tossed on here.
You will get better work a little later.
ID: 104940 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104941 - Posted: 18 Feb 2022, 16:02:47 UTC - in response to Message 104939.  

Lucky me...found bug #2
ID: 104941 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 257
Credit: 483,503
RAC: 397
Message 104942 - Posted: 18 Feb 2022, 16:04:11 UTC - in response to Message 104939.  

Maybe there is a bug with boinc disk space measurements?
Open an issue at https://github.com/BOINC/boinc/issues
ID: 104942 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104943 - Posted: 18 Feb 2022, 16:32:46 UTC - in response to Message 104942.  

Maybe there is a bug with boinc disk space measurements?
Open an issue at https://github.com/BOINC/boinc/issues



New issue opened....
ID: 104943 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ricky@SETI.USA
Avatar

Send message
Joined: 13 Dec 05
Posts: 20
Credit: 97,355
RAC: 0
Message 104944 - Posted: 18 Feb 2022, 17:08:02 UTC - in response to Message 80621.  
Last modified: 18 Feb 2022, 17:13:55 UTC

Just started back running this project and so far I got nothing but Computation Errors on every WU this PC has tried to run within 12 sec of running! Correction make that 25 sec.!
"Life is like an Ice Cream cone, just when you think you got it licked, it drips all over you!"

ID: 104944 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 257
Credit: 483,503
RAC: 397
Message 104945 - Posted: 18 Feb 2022, 17:10:15 UTC - in response to Message 104944.  

Abort all movingstub tasks. They crash on windows for some reason
ID: 104945 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Ricky@SETI.USA
Avatar

Send message
Joined: 13 Dec 05
Posts: 20
Credit: 97,355
RAC: 0
Message 104946 - Posted: 18 Feb 2022, 17:31:14 UTC - in response to Message 104944.  

After getting all Computation Errors on all the WU's I set my PC to No New Tasks until you folks can figure out why Windows crashes with the "movingstub" Work units
"Life is like an Ice Cream cone, just when you think you got it licked, it drips all over you!"

ID: 104946 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
JohnDK
Avatar

Send message
Joined: 6 Apr 20
Posts: 33
Credit: 2,390,240
RAC: 0
Message 104948 - Posted: 18 Feb 2022, 17:49:11 UTC

Don't like those python WUs, but with the movingstub problems, I will have to continue with them.
ID: 104948 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Jim1348

Send message
Joined: 19 Jan 06
Posts: 881
Credit: 52,257,545
RAC: 0
Message 104951 - Posted: 18 Feb 2022, 21:19:59 UTC

Total queued jobs: 4,144,753

Someone at Rosetta has great faith in our ability to run these, though I don't know why.

But I have found that the pythons do not suspend so much if I run only 50% of the cores.
Or maybe they have changed them.
ID: 104951 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1664
Credit: 17,383,150
RAC: 24,367
Message 104955 - Posted: 18 Feb 2022, 21:51:13 UTC - in response to Message 104951.  

Total queued jobs: 4,144,753

Someone at Rosetta has great faith in our ability to run these, though I don't know why.
2 million Tasks for LINUX systems, Instant crash and burn on Windows.
So it's going to take a while to clear them unless the project pulls them & fixes & then re-issues them, or puts in a flag with the Scheduler to only allocate them to LINUX systems. No hope of the second, very slight hope for the first.
Grant
Darwin NT
ID: 104955 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 104958 - Posted: 18 Feb 2022, 22:31:05 UTC

movingstub problems
I have a two almost identical systems
P5Q DeLux motherboard with Q9450 cpu 8GB RAM , win7 , its a crash test dummy . 1*
P5Q DeLux motherboard with Q9550 cpu 8GB RAM . Linux mint , Just another day at the office , crunchin on regardless
Funny old world
----------------------------------------
1* so I feel like being a git
Set cache setting to 10 days and try and trash as many of them as possible :-), until it gets backoffd
Yes that is what I have done
see how many I can get rid of
But , keep an eye on it in case any good work arrives.
ID: 104958 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1664
Credit: 17,383,150
RAC: 24,367
Message 104959 - Posted: 18 Feb 2022, 22:34:30 UTC - in response to Message 104958.  
Last modified: 18 Feb 2022, 22:36:49 UTC

Set cache setting to 10 days and try and trash as many of them as possible :-), until it gets backoffd
Yes that is what I have done
see how many I can get rid of
Not that many (compared to how many there are to get through).
For a particular application, for every error you return, you have the amount of work you can download reduced by 1 until you get to the point you will only be able to get 1 Task per 24 hours.
Returning Valid work increases the amount of work you can download per day.
Grant
Darwin NT
ID: 104959 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
.clair.

Send message
Joined: 2 Jan 07
Posts: 274
Credit: 26,399,595
RAC: 0
Message 104961 - Posted: 18 Feb 2022, 22:49:39 UTC
Last modified: 18 Feb 2022, 22:53:59 UTC

On the win cruncher all its pythons go zombie so I don't let it do them , normal R4.2 is ok
now It will only let me have 29 at a time to trash ,
And I am already getting 4 hours `go away and don't be silly` time.
nnnn poot .
ID: 104961 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1664
Credit: 17,383,150
RAC: 24,367
Message 104963 - Posted: 18 Feb 2022, 22:55:32 UTC - in response to Message 104961.  
Last modified: 18 Feb 2022, 22:55:54 UTC

And I am already getting 4 hours `go away and don't be silly` time.
That will happen every time Tasks error out (it could be anything from 3min to well over 4 hours, the more that error out between Scheduler contacts the larger the backoff tends to be).
Grant
Darwin NT
ID: 104963 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104964 - Posted: 18 Feb 2022, 23:34:08 UTC - in response to Message 104946.  

After getting all Computation Errors on all the WU's I set my PC to No New Tasks until you folks can figure out why Windows crashes with the "movingstub" Work units



Unless one of our experts here reaches out to his contact at UW, there is no one that will see these posts.
So "you folks" is a pointless thing.
ID: 104964 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104965 - Posted: 18 Feb 2022, 23:34:51 UTC

24 movingstubs were in my queue. Aborted them instantly.
Not going to up my error count for stupidness from their end.
ID: 104965 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104966 - Posted: 18 Feb 2022, 23:36:35 UTC - in response to Message 104961.  

.clair. you've done all the different things we as a group have talked about for python?
downgrade boinc and vbox and check your virtualization setting on your motherboard?
If python still dies on you after all that, that is weird.
ID: 104966 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104967 - Posted: 18 Feb 2022, 23:41:18 UTC - in response to Message 104944.  

Ricky - your just getting the movingstubs garbage. Watch your queue and abort them as soon as you see them.
They do not work.

If you want to do Vbox stuff, then you can run python tasks

4.2 is a mishmash of stuff, but movingstubs is trash and rb_02_16_213037 has a bug
Also in python the aagb-PHE stuff is buggy
ID: 104967 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Greg_BE
Avatar

Send message
Joined: 30 May 06
Posts: 5691
Credit: 5,859,226
RAC: 1
Message 104968 - Posted: 18 Feb 2022, 23:41:53 UTC

I have noticed that in 4.2 the rb_02_16_213037..... has a bug
Also in python the aagb-PHE... stuff is buggy
ID: 104968 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1664
Credit: 17,383,150
RAC: 24,367
Message 104969 - Posted: 19 Feb 2022, 0:10:27 UTC - in response to Message 104965.  

24 movingstubs were in my queue. Aborted them instantly.
Not going to up my error count for stupidness from their end.
Aborted Tasks count as errors.
Grant
Darwin NT
ID: 104969 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Previous · 1 . . . 178 · 179 · 180 · 181 · 182 · 183 · 184 . . . 299 · Next

Message boards : Number crunching : Problems and Technical Issues with Rosetta@home



©2024 University of Washington
https://www.bakerlab.org