hanging on 1.9.0

All your general support questions for OpenZFS on OS X.

hanging on 1.9.0

Postby mauricev » Mon Jun 17, 2019 8:03 pm

I have a 3 TB ZFS mirror using zfs 1.9.0 under 10.14.5 and I'm rsyncing files to it and along with running a virtual machine. After a while, I/O hangs. What's going on?
mauricev
 
Posts: 16
Joined: Mon Oct 27, 2014 9:57 pm

Re: hanging on 1.9.0

Postby tangles » Tue Jun 18, 2019 4:45 am

Not much by the sounds of it…

What's going on? who knows…
You switched it off? you closed your eyes?
Sorry for taking the piss… but mate… we're not mind readers.

Please provide a bit more info, here's some suggestions…

Hardware description of Mac and zpool connectivity.
Output of:
zpool status and zpool list so we can see how your pool is setup and what state it's in.
zfs get all on <dataset in question>
zpool iostat -v 1 600 while running rsync to see if any vdev has poor I/O.

By providing the above, the community will have a better chance to help you.

Cheers,
tangles
 
Posts: 149
Joined: Tue Jun 17, 2014 6:54 am

Re: hanging on 1.9.0

Postby mauricev » Tue Jun 18, 2019 6:47 am

Trash can Mac with a JMicron-based 2-bay disk enclosure connected via USB 3.
Code: Select all
 pool: externalhd
 state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
   still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
   the pool may no longer be accessible by software that does not support
   the features. See zpool-features(5) for details.
  scan: scrub in progress since Tue Jun 18 10:26:29 2019
   826G scanned at 810M/s, 26.0G issued at 25.5M/s, 2.41T total
   0 repaired, 1.05% done, 1 days 03:12:32 to go
config:

   NAME                                            STATE     READ WRITE CKSUM
   externalhd                                      ONLINE       0     0     0
     mirror-0                                      ONLINE       0     0     0
       media-A1232949-F65F-A64B-B241-D4DBBA49E0A0  ONLINE       0     0     0
       media-448C5ECE-160D-7446-BF08-3BED3AE018B5  ONLINE       0     0     0

errors: No known data errors

As you can see, I'm running a scrub.

Code: Select all
NAME         SIZE  ALLOC   FREE  CKPOINT  EXPANDSZ   FRAG    CAP  DEDUP  HEALTH  ALTROOT
externalhd  2.72T  2.41T   319G        -         -    16%    88%  1.00x  ONLINE  -


I saw this message on trying to reboot after it hung
Code: Select all
Warning: Pool 'externalhd' has encountered an uncorrectable I/O failure and has been suspended.


Bad USB cable? Bad controller electronics?
mauricev
 
Posts: 16
Joined: Mon Oct 27, 2014 9:57 pm

Re: hanging on 1.9.0

Postby mauricev » Tue Jun 18, 2019 7:14 am

I am separately running a Mac program Drive Genius which is supposed to monitor disk health. It's now reporting that one of the two disks, a Toshiba DT01ACA300, has a significant number of damaged areas. This implies the disk is very sick, but the scrub is still running and not detecting any errors. How are the errors apparent to it, but not zfs? Why should the errors of one disk hang the whole pool?
mauricev
 
Posts: 16
Joined: Mon Oct 27, 2014 9:57 pm

Re: hanging on 1.9.0

Postby sean » Tue Jun 18, 2019 7:36 am

mauricev wrote:...one of the two disks, a Toshiba DT01ACA300, has a significant number of damaged areas. This implies the disk is very sick, but the scrub is still running and not detecting any errors. How are the errors apparent to it, but not zfs?


The point of view from the disk and the filesystem can be very different. Drives can attempt to remap bad blocks, etc. I think the more relevant piece is your scrub is only ~1% complete, so it's premature to think that zfs won't find any errors. I don't think there is much point in waiting around to see if it does, though.

maurice wrote:Why should the errors of one disk hang the whole pool?


It doesn't take many resets and / or timeouts to send drive performance right into the drain, and I suspect you're getting a LOT of them. Since you have a mirror, I would stop the scrub, pull the drive, put in a replacement, resilver, and carry on.
sean
 
Posts: 2
Joined: Sat May 18, 2019 7:25 am

Re: hanging on 1.9.0

Postby lundman » Tue Jun 18, 2019 3:47 pm

"an uncorrectable I/O failure and has been suspended. "


ZFS detected the disk more or less vanished, and was forced to give up - you will not get more data from ZFS after that. You can issue "zpool clear pool" and "zpool clear pool device" to ask it to retry talking to the disk, but it seems likely the disk will glitch again.
User avatar
lundman
 
Posts: 615
Joined: Thu Mar 06, 2014 2:05 pm
Location: Tokyo, Japan

Re: hanging on 1.9.0

Postby mauricev » Thu Jun 20, 2019 7:13 am

I replaced the disk and the pool seems to be working normally.
mauricev
 
Posts: 16
Joined: Mon Oct 27, 2014 9:57 pm


Return to General Help

Who is online

Users browsing this forum: Google [Bot] and 1 guest

cron