Page 2 of 2

Re: Read errors on stripe cause hang/crash.

PostPosted: Thu Mar 26, 2020 8:22 am
by zandr
memtest86 passed with no issues. I needed to move ahead with rebuilding the array for my Photo library, but I'll take these two flaky disks and try to reproduce this on another box.

With the replacement disks, I've replicated the dataset back, and am most of the way through a scrub. Fingers Crossed. :D

Re: Read errors on stripe cause hang/crash.

PostPosted: Fri Mar 27, 2020 1:58 pm
by lundman
MCA Error Report ***\nCPU Machine Check Architecture Error Dump

That's the first time I've seen that, and appears to only talk about hardware, not software, so possibly ZFS isn't even involved - it certainly does
not contain a list of kexts like we are used to seeing.

Re: Read errors on stripe cause hang/crash.

PostPosted: Sun Mar 29, 2020 9:17 am
by zandr
lundman wrote:so possibly ZFS isn't even involved


Other than the "completely repeatable by importing the zpool and letting the scrub complete" bit, yeah.

- it certainly does not contain a list of kexts like we are used to seeing.


Because it isn't the same kind of crash. On the stripe, it would just lock up the machine without a panic. On the mirror, it managed to throw this.

I'll be able to do some additional testing this week. Need to get some work done using that hardware first.

Re: Read errors on stripe cause hang/crash.

PostPosted: Wed Apr 01, 2020 11:46 pm
by lundman
You could try disabling all the assembler versions (sysctl, setting the 4? to default/generic) in case it is something with saving registers that go wrong.

Re: Read errors on stripe cause hang/crash.

PostPosted: Mon Jun 08, 2020 7:25 am
by 4ever6
lundman wrote:You could try disabling all the assembler versions (sysctl, setting the 4? to default/generic) in case it is something with saving registers that go wrong.


Curious about this - which sysctls are you referring to?

Re: Read errors on stripe cause hang/crash.

PostPosted: Mon Jun 15, 2020 10:46 pm
by lundman
There are four: zfs_vdev_raidz_impl icp_gcm_impl icp_aes_impl zfs_fletcher_4_impl

You set them to "generic" and hmm, standard? Whatever the most benign sounding one is :)

Re: Read errors on stripe cause hang/crash.

PostPosted: Tue Jun 16, 2020 2:01 am
by jawbroken
I have had two of these same crashes in the last month or so with the same enclosure. I recently upgraded to the latest Catalina versions, after lagging behind a bit. It's suspicious that we all have the same hardware, but given the generic error message it could be any combination of hardware, OS, or OpenZFS.