Checksums don't match

A forum just for SPCR's folding team... by request.

Moderators: NeilBlanchard, Ralf Hutter, sthayashi, Lawrence Lee

Post Reply
wainwra
*Lifetime Patron*
Posts: 78
Joined: Wed Sep 28, 2005 5:24 am
Location: Starnberg, Germany

Checksums don't match

Post by wainwra » Wed Oct 05, 2005 7:22 pm

Well, I seem to be having a difficult introduction to Folding@Home. As mentioned in a previous topic I had a checksum problem, close to the end of my first (and very long) project. It restarted from scratch, and several days later, eventually finished.

Last night I was expecting to finish my second project, but checking I see I've been having checksum problems:

Code: Select all

[19:40:06] Completed 395000 out of 500000 steps  (79)
[19:55:06] Timered checkpoint triggered.
[20:10:06] Timered checkpoint triggered.
[20:23:21] Writing local files
[20:23:21] Completed 400000 out of 500000 steps  (80)
[20:23:21] - Checksums don't match (work/wudata_02.xtc)
[20:23:22] Premature end of file when checksumming (636284 bytes left)
[20:23:22] - Could not calculate checksum (work/wudata_02.xtc)
[20:23:23] Checksum not what expected.
[20:23:23] 
[20:23:23] Folding@home Core Shutdown: FILE_IO_ERROR
[20:23:26] CoreStatus = 75 (117)
[20:23:26] Error opening or reading from a file.
[20:23:26] Deleting current work unit & continuing...
[20:23:30] Trying to send all finished work units
[20:23:30] + No unsent completed units remaining.
[20:23:30] - Preparing to get new work unit...
[20:23:30] + Attempting to get work packet
[20:23:30] - Will indicate memory of 1022 MB.
[20:23:30] - Connecting to assignment server
[20:23:30] Connecting to http://assign.stanford.edu:8080/
[20:23:31] Posted data.
[20:23:31] Initial: 40AB; - Successful: assigned to (171.64.122.127).
[20:23:31] + News From Folding@Home: Welcome to Folding@Home
[20:23:31] Loaded queue successfully.
[20:23:31] Connecting to http://171.64.122.127:8080/
[20:23:32] Posted data.
[20:23:32] Initial: 0000; - Receiving payload (expected size: 209214)
[20:23:34] - Downloaded at ~102 kB/s
[20:23:34] - Averaged speed for that direction ~76 kB/s
[20:23:34] + Received work.
[20:23:34] + Closed connections
[20:23:39] 
[20:23:39] + Processing work unit
[20:23:39] Core required: FahCore_78.exe
[20:23:39] Core found.
[20:23:39] Working on Unit 03 [October 5 20:23:39]
[20:23:39] + Working ...
[20:23:39] - Calling 'FahCore_78.exe -dir work/ -suffix 03 -checkpoint 15 -service -forceasm -verbose -lifeline 1612 -version 502'

[20:23:39] 
[20:23:39] *------------------------------*
[20:23:39] Folding@Home Gromacs Core
[20:23:39] Version 1.86 (August 28, 2005)
[20:23:39] 
[20:23:39] Preparing to commence simulation
[20:23:39] - Assembly optimizations manually forced on.
[20:23:39] - Not checking prior termination.
[20:23:40] - Expanded 208702 -> 1016269 (decompressed 486.9 percent)
[20:23:40] - Data doesn't match checksum.
[20:23:40] - Starting from initial work packet
[20:23:40] 
[20:23:40] Project: 246 (Run 5, Clone 45, Gen 104)
[20:23:40] 
[20:23:40] Assembly optimizations on if available.
[20:23:40] Entering M.D.
[20:23:46] Protein: p246_vil0MUreGS
[20:23:46] 
[20:23:46] Writing local files
[20:23:46] Size of work/wudata_03.bed not what saved.
[20:23:46] 
[20:23:46] Folding@home Core Shutdown: FILE_IO_ERROR
[20:23:49] CoreStatus = 75 (117)
[20:23:49] Error opening or reading from a file.
[20:23:49] Deleting current work unit & continuing...
[20:23:53] Trying to send all finished work units
[20:23:53] + No unsent completed units remaining.
[20:23:53] - Preparing to get new work unit...
[20:23:53] + Attempting to get work packet
[20:23:53] - Will indicate memory of 1022 MB.
[20:23:53] - Connecting to assignment server
[20:23:53] Connecting to http://assign.stanford.edu:8080/
[20:23:54] Posted data.
[20:23:54] Initial: 40AB; - Successful: assigned to (171.64.122.127).
[20:23:54] + News From Folding@Home: Welcome to Folding@Home
[20:23:54] Loaded queue successfully.
[20:23:54] Connecting to http://171.64.122.127:8080/
[20:23:55] Posted data.
[20:23:55] Initial: 0000; - Receiving payload (expected size: 209214)
[20:24:00] - Downloaded at ~40 kB/s
[20:24:00] - Averaged speed for that direction ~67 kB/s
[20:24:00] + Received work.
[20:24:00] + Closed connections
[20:24:05] 
[20:24:05] + Processing work unit
[20:24:05] Core required: FahCore_78.exe
[20:24:05] Core found.
[20:24:05] Working on Unit 04 [October 5 20:24:05]
[20:24:05] + Working ...
[20:24:05] - Calling 'FahCore_78.exe -dir work/ -suffix 04 -checkpoint 15 -service -forceasm -verbose -lifeline 1612 -version 502'

[20:24:05] 
[20:24:05] *------------------------------*
[20:24:05] Folding@Home Gromacs Core
[20:24:05] Version 1.86 (August 28, 2005)
[20:24:05] 
[20:24:05] Preparing to commence simulation
[20:24:05] - Assembly optimizations manually forced on.
[20:24:05] - Not checking prior termination.
[20:24:05] - Expanded 208702 -> 1016269 (decompressed 486.9 percent)
[20:24:05] - Starting from initial work packet
[20:24:05] 
[20:24:05] Project: 246 (Run 5, Clone 45, Gen 104)
[20:24:05] 
[20:24:05] Assembly optimizations on if available.
[20:24:05] Entering M.D.
[20:24:12] Protein: p246_vil0MUreGS
[20:24:12] 
[20:24:12] Writing local files
[20:24:12] Extra SSE boost OK.
[20:24:12] Writing local files
[20:24:12] Completed 0 out of 1000000 steps  (0)
[20:39:13] Timered checkpoint triggered.
[20:41:45] Writing local files
[20:41:45] Completed 10000 out of 1000000 steps  (1)
and again:

Code: Select all

[23:16:35] Completed 100000 out of 1000000 steps  (10)
[22:59:10] Completed 90000 out of 1000000 steps  (9)
[23:14:09] Timered checkpoint triggered.
[23:16:35] Writing local files
[23:16:35] Completed 100000 out of 1000000 steps  (10)
[23:16:35] - Checksums don't match (work/wudata_04.xtc)
[23:16:36] Premature end of file when checksumming (30684 bytes left)
[23:16:36] - Could not calculate checksum (work/wudata_04.xtc)
[23:16:37] Checksum not what expected.
[23:16:37] 
[23:16:37] Folding@home Core Shutdown: FILE_IO_ERROR
[23:16:41] CoreStatus = 75 (117)
[23:16:41] Error opening or reading from a file.
[23:16:41] Deleting current work unit & continuing...
[23:16:45] Trying to send all finished work units
[23:16:45] + No unsent completed units remaining.
[23:16:45] - Preparing to get new work unit...
[23:16:45] + Attempting to get work packet
[23:16:45] - Will indicate memory of 1022 MB.
[23:16:45] - Connecting to assignment server
[23:16:45] Connecting to http://assign.stanford.edu:8080/
[23:16:46] Posted data.
[23:16:46] Initial: 40AB; - Successful: assigned to (171.64.122.124).
[23:16:46] + News From Folding@Home: Welcome to Folding@Home
[23:16:46] Loaded queue successfully.
[23:16:46] Connecting to http://171.64.122.124:8080/
[23:16:48] Posted data.
[23:16:49] Initial: 0000; - Receiving payload (expected size: 210891)
[23:17:02] - Downloaded at ~15 kB/s
[23:17:02] - Averaged speed for that direction ~57 kB/s
[23:17:02] + Received work.
[23:17:02] + Closed connections
[23:17:07] 
[23:17:07] + Processing work unit
[23:17:07] Core required: FahCore_78.exe
[23:17:07] Core found.
[23:17:07] Working on Unit 05 [October 5 23:17:07]
[23:17:07] + Working ...
[23:17:07] - Calling 'FahCore_78.exe -dir work/ -suffix 05 -checkpoint 15 -service -forceasm -verbose -lifeline 1612 -version 502'

[23:17:08] 
[23:17:08] *------------------------------*
[23:17:08] Folding@Home Gromacs Core
[23:17:08] Version 1.86 (August 28, 2005)
[23:17:08] 
[23:17:08] Preparing to commence simulation
[23:17:08] - Assembly optimizations manually forced on.
[23:17:08] - Not checking prior termination.
[23:17:08] - Expanded 210379 -> 1005685 (decompressed 478.0 percent)
[23:17:08] - Data doesn't match checksum.
[23:17:08] - Starting from initial work packet
[23:17:08] 
[23:17:08] Project: 233 (Run 0, Clone 64, Gen 113)
[23:17:08] 
[23:17:09] Assembly optimizations on if available.
[23:17:09] Entering M.D.
[23:17:15] Protein: p233_vil1MUre99p
[23:17:15] 
[23:17:15] Writing local files
[23:17:15] Size of work/wudata_05.bed not what saved.
[23:17:15] 
[23:17:15] Folding@home Core Shutdown: FILE_IO_ERROR
[23:17:18] CoreStatus = 75 (117)
[23:17:18] Error opening or reading from a file.
[23:17:18] Deleting current work unit & continuing...
[23:17:22] Trying to send all finished work units
[23:17:22] + No unsent completed units remaining.
[23:17:22] - Preparing to get new work unit...
[23:17:22] + Attempting to get work packet
[23:17:22] - Will indicate memory of 1022 MB.
[23:17:22] - Connecting to assignment server
[23:17:22] Connecting to http://assign.stanford.edu:8080/
[23:17:23] Posted data.
[23:17:23] Initial: 40AB; - Successful: assigned to (171.64.122.124).
[23:17:23] + News From Folding@Home: Welcome to Folding@Home
[23:17:23] Loaded queue successfully.
[23:17:23] Connecting to http://171.64.122.124:8080/
[23:17:25] Posted data.
[23:17:25] Initial: 0000; - Receiving payload (expected size: 210891)
[23:17:34] - Downloaded at ~22 kB/s
[23:17:34] - Averaged speed for that direction ~50 kB/s
[23:17:34] + Received work.
[23:17:34] + Closed connections
[23:17:39] 
[23:17:39] + Processing work unit
[23:17:39] Core required: FahCore_78.exe
[23:17:39] Core found.
[23:17:39] Working on Unit 06 [October 5 23:17:39]
[23:17:39] + Working ...
[23:17:39] - Calling 'FahCore_78.exe -dir work/ -suffix 06 -checkpoint 15 -service -forceasm -verbose -lifeline 1612 -version 502'

[23:17:39] 
[23:17:39] *------------------------------*
[23:17:39] Folding@Home Gromacs Core
[23:17:39] Version 1.86 (August 28, 2005)
[23:17:39] 
[23:17:39] Preparing to commence simulation
[23:17:39] - Assembly optimizations manually forced on.
[23:17:39] - Not checking prior termination.
[23:17:39] - Expanded 210379 -> 1005685 (decompressed 478.0 percent)
[23:17:39] - Starting from initial work packet
[23:17:39] 
[23:17:39] Project: 233 (Run 0, Clone 64, Gen 113)
[23:17:39] 
[23:17:39] Assembly optimizations on if available.
[23:17:39] Entering M.D.
[23:17:46] Protein: p233_vil1MUre99p
[23:17:46] 
[23:17:46] Writing local files
[23:17:46] Extra SSE boost OK.
[23:17:46] Writing local files
[23:17:46] Completed 0 out of 1000000 steps  (0)
[23:32:47] Timered checkpoint triggered.
[23:35:56] Writing local files
[23:35:56] Completed 10000 out of 1000000 steps  (1)
and -sigh- again

Code: Select all

[02:00:00] Completed 90000 out of 1000000 steps  (9)
[02:15:01] Timered checkpoint triggered.
[02:25:17] Writing local files
[02:25:17] Completed 100000 out of 1000000 steps  (10)
[02:25:17] - Checksums don't match (work/wudata_06.xtc)
[02:25:18] Premature end of file when checksumming (29416 bytes left)
[02:25:18] - Could not calculate checksum (work/wudata_06.xtc)
[02:25:19] Checksum not what expected.
[02:25:20] 
[02:25:20] Folding@home Core Shutdown: FILE_IO_ERROR
[02:25:22] CoreStatus = 75 (117)
[02:25:22] Error opening or reading from a file.
[02:25:22] Deleting current work unit & continuing...
[02:25:29] Trying to send all finished work units
[02:25:29] + No unsent completed units remaining.
[02:25:29] - Preparing to get new work unit...
[02:25:29] + Attempting to get work packet
[02:25:29] - Will indicate memory of 1022 MB.
[02:25:29] - Connecting to assignment server
[02:25:29] Connecting to http://assign.stanford.edu:8080/
[02:25:31] Posted data.
[02:25:31] Initial: 40AB; - Successful: assigned to (171.64.122.124).
[02:25:31] + News From Folding@Home: Welcome to Folding@Home
[02:25:31] Loaded queue successfully.
[02:25:31] Connecting to http://171.64.122.124:8080/
[02:25:33] Posted data.
[02:25:33] Initial: 0000; - Receiving payload (expected size: 210891)
[02:25:35] - Downloaded at ~102 kB/s
[02:25:35] - Averaged speed for that direction ~60 kB/s
[02:25:35] + Received work.
[02:25:35] + Closed connections
[02:25:40] 
[02:25:40] + Processing work unit
[02:25:40] Core required: FahCore_78.exe
[02:25:40] Core found.
[02:25:40] Working on Unit 07 [October 6 02:25:40]
[02:25:40] + Working ...
[02:25:40] - Calling 'FahCore_78.exe -dir work/ -suffix 07 -checkpoint 15 -service -forceasm -verbose -lifeline 1612 -version 502'

[02:25:40] 
[02:25:40] *------------------------------*
[02:25:40] Folding@Home Gromacs Core
[02:25:40] Version 1.86 (August 28, 2005)
[02:25:40] 
[02:25:40] Preparing to commence simulation
[02:25:40] - Assembly optimizations manually forced on.
[02:25:40] - Not checking prior termination.
[02:25:41] - Expanded 210379 -> 1005685 (decompressed 478.0 percent)
[02:25:41] - Data doesn't match checksum.
[02:25:41] - Starting from initial work packet
[02:25:41] 
[02:25:41] Project: 233 (Run 0, Clone 64, Gen 113)
[02:25:41] 
[02:25:41] Assembly optimizations on if available.
[02:25:41] Entering M.D.
[02:25:48] Protein: p233_vil1MUre99p
[02:25:48] 
[02:25:48] Writing local files
[02:25:48] Size of work/wudata_07.bed not what saved.
[02:25:48] 
[02:25:48] Folding@home Core Shutdown: FILE_IO_ERROR
[02:25:50] CoreStatus = 75 (117)
[02:25:50] Error opening or reading from a file.
[02:25:50] Deleting current work unit & continuing...
[02:25:54] Trying to send all finished work units
[02:25:54] + No unsent completed units remaining.
[02:25:54] - Preparing to get new work unit...
[02:25:54] + Attempting to get work packet
[02:25:54] - Will indicate memory of 1022 MB.
[02:25:54] - Connecting to assignment server
[02:25:54] Connecting to http://assign.stanford.edu:8080/
[02:25:55] Posted data.
[02:25:55] Initial: 40AB; - Successful: assigned to (171.64.122.124).
[02:25:55] + News From Folding@Home: Welcome to Folding@Home
[02:25:55] Loaded queue successfully.
[02:25:55] Connecting to http://171.64.122.124:8080/
[02:25:57] Posted data.
[02:25:57] Initial: 0000; - Receiving payload (expected size: 210891)
[02:25:59] - Downloaded at ~102 kB/s
[02:25:59] - Averaged speed for that direction ~69 kB/s
[02:25:59] + Received work.
[02:25:59] + Closed connections
[02:26:04] 
[02:26:04] + Processing work unit
[02:26:04] Core required: FahCore_78.exe
[02:26:04] Core found.
[02:26:04] Working on Unit 08 [October 6 02:26:04]
[02:26:04] + Working ...
[02:26:04] - Calling 'FahCore_78.exe -dir work/ -suffix 08 -checkpoint 15 -service -forceasm -verbose -lifeline 1612 -version 502'

[02:26:04] 
[02:26:04] *------------------------------*
[02:26:04] Folding@Home Gromacs Core
[02:26:04] Version 1.86 (August 28, 2005)
[02:26:04] 
[02:26:04] Preparing to commence simulation
[02:26:04] - Assembly optimizations manually forced on.
[02:26:04] - Not checking prior termination.
[02:26:05] - Expanded 210379 -> 1005685 (decompressed 478.0 percent)
[02:26:05] - Starting from initial work packet
[02:26:05] 
[02:26:05] Project: 233 (Run 0, Clone 64, Gen 113)
[02:26:05] 
[02:26:05] Assembly optimizations on if available.
[02:26:05] Entering M.D.
[02:26:11] Protein: p233_vil1MUre99p
[02:26:11] 
[02:26:11] Writing local files
[02:26:12] Extra SSE boost OK.
[02:26:12] Writing local files
[02:26:12] Completed 0 out of 1000000 steps  (0)
[02:41:17] Timered checkpoint triggered.
I checked back and found this one reference, a post that NeilBlanchard made where it looked as though the cause was dodgy RAM (is that right?)

I'm running on a standard IBM Thinkpad (R50p, 1Gb RAM), definitely not overclocked.

JanW
Posts: 296
Joined: Fri Dec 03, 2004 12:38 pm
Location: France, Europe Folding for SPCR

Re: Checksums don't match

Post by JanW » Sun Oct 16, 2005 3:52 pm

wainwra wrote:...I've been having checksum problems:

Code: Select all

[20:23:23] Folding@home Core Shutdown: FILE_IO_ERROR
Sorry for suggesting the obvious, but you do have enough disk space left, right? There are probably other potential reasons this happens, but I'm not knowledgeable enough to think of any.

Straker
Posts: 657
Joined: Fri Jul 23, 2004 11:10 pm
Location: AB, Canada
Contact:

Post by Straker » Sun Oct 16, 2005 7:20 pm

assuming you have enough space, and aren't doing anything silly like constantly background defragging, try searching F@H forums other places?
might be worth running scandisk etc too.

I've never had problems with overclocking nor RAM (my old PC had 2x256mb and 1x512mb DIMMs, and at least one mismatched brand), F@H seems pretty damage resistant.

wainwra
*Lifetime Patron*
Posts: 78
Joined: Wed Sep 28, 2005 5:24 am
Location: Starnberg, Germany

Ahem

Post by wainwra » Mon Oct 17, 2005 4:03 am

I thought at one stage that this was caused by having two fah-core's running. (I had initially installed the GUI version by mistake, and after removing that and installing the console version, I found that the GUI version was re-starting itself on boot.)

However, that doesn't make too much sense, given that people with dual core machines are running two of these things successfully.

I think :oops: the most likely candidate is disk space. I hadn't had any error messages (from any other product), but I did have something running that was handling large files, and it's possible that there may have been times when disk space was low.

Anyway, I've addressed both issues now, and have been productively processing proteins ever since.

Tibors
Patron of SPCR
Posts: 2674
Joined: Sun Jul 04, 2004 6:07 am
Location: Houten, The Netherlands, Europe

Post by Tibors » Mon Oct 17, 2005 5:15 pm

Sometimes (software) firewalls cause this type of problem. (IIRC Norton was often the culprit.)
For more info look at http://forum.folding-community.org/.

JanW
Posts: 296
Joined: Fri Dec 03, 2004 12:38 pm
Location: France, Europe Folding for SPCR

Post by JanW » Wed Oct 19, 2005 10:34 am

Sorry, got to go offtopic for a sec...

Welcome back to folding, Tibors!!!!!!!

I remember you saying that electricity would need to become cheaper before you started folding again. Hard to believe that should have happened... Anyway, glad to have you back on the team! Of course I lowly profited from your absence to sneak by :twisted: but I might not be able to hold you off if you choose to add back more than the one machine you have folding right now. :wink:

Tibors
Patron of SPCR
Posts: 2674
Joined: Sun Jul 04, 2004 6:07 am
Location: Houten, The Netherlands, Europe

Post by Tibors » Wed Oct 19, 2005 11:45 am

There is a sad reason for me to be folding again. As you can see, I have no longer a system 1 in my sig. The mobo in my Hush mini-ITX system died. The cap right next to the network connector has brown stuff comming out of it and several other caps are swollen. That was the machine I used for everyday work, so it was on most of the day, but way to slow to use for folding.

Now I am using system 3 in my sig for the day to day work. That machine can do some usefull folding work. Folding increases the powerconsumption of that system (without monitor) from ~42W to ~60W. I can afford that 18W difference even with the current electricity prices. I'm definately not planning to turn my intel machine back on 24/7 with the current prices.

Post Reply