Questions and Answers : Windows : Work restarting from scratch on reboot.
Author | Message |
---|---|
Brian S. Wilson Send message Joined: 2 Dec 12 Posts: 4 Credit: 97,591 RAC: 0 |
My system has to restart as certain times and I suspend the Rosetta@home project then do the restart. All work (many hours of computations)that was done is gone when the system restarts. I thought the project was supposed to write their data every so often so a restart wouldn't lose everything. Any ideas on what the issue might be? I'll check back to see responses as my email isn't working well at the moment. I've got a Windows XP Pro 32 Bit (SP3) system with 4GB memory, plenty of free disk space, and /temp and my browser cache on a ram drive. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
There is a setting in the compute preferences for "task checkpoint to disk at most every ... seconds". This value is intended to avoid tasks utilizing too much disk IO. You cannot force an application to take a checkpoint, but if you entered a very large value in that entry, you are perhaps preventing it from checkpointing when it would be able to. Typically tasks checkpoint several times an hour, but it depends on the type of task. Some can take over an hour to reach a point that a checkpoint can be recorded. You can look at the properties of each task and see the current CPU time and the CPU time at which the last checkpoint was taken. Other than leaving the machine on for longer periods of time, there really isn't much else you can do to effect the behavior. Every time you exit BOINC or power off, you will lose some work, it's just a question of how much. Rosetta Moderator: Mod.Sense |
Brian S. Wilson Send message Joined: 2 Dec 12 Posts: 4 Credit: 97,591 RAC: 0 |
There is a setting in the compute preferences for "task checkpoint to disk at most every ... seconds". Yes, I apologize for not including this value in my earlier issue statement. It is set to 60 seconds on my system. |
Brian S. Wilson Send message Joined: 2 Dec 12 Posts: 4 Credit: 97,591 RAC: 0 |
Typically tasks checkpoint several times an hour, but it depends on the type of task. Some can take over an hour to reach a point that a checkpoint can be recorded. You can look at the properties of each task and see the current CPU time and the CPU time at which the last checkpoint was taken. I was wondering where the checkpoints are saved. I've always been assuming it was on my C: drive (as I've got 10GB allocated for BOINC); but I don't see where this is specified in the preferences and I don't know where the various configuration files are kept, what they are called, or if they are human readable. |
Mod.Sense Volunteer moderator Send message Joined: 22 Aug 06 Posts: 4018 Credit: 0 RAC: 0 |
When you installed BOINC it prompted you for a path name under which all of the applications and tasks are stored. You can see the path that was select in the first set of messages in the log file as BOINC starts. Under that path, there will be a slots directory. A "slot" is a task that is in progress. Each task has it's own slot until it has ended. Since checkpoints are associated with specific tasks, they are stored in the slot directory for the task they pertain to. Rosetta Moderator: Mod.Sense |
vakobo Send message Joined: 3 Aug 08 Posts: 18 Credit: 13,906,353 RAC: 621 |
I have default setting for task checkpoints (60 s), but some tasks do not have checkpoint when i see information about those tasks. If i shutdown my computer and turn it on next time those tasks will start from 0%. This is very sad. Especially when task was more then 90% ready. |
Questions and Answers :
Windows :
Work restarting from scratch on reboot.
©2024 University of Washington
https://www.bakerlab.org