f3 - Asus E35M1-I - FreeNAS
Hardware info on main page.
This machine runs the 64-bit (amd64) version of FreeNAS.
FreeNAS build: FreeNAS-8.2.0-RELEASE-p1-x64 (r11950)
Physical to logical disk mapping (only valid if all all disks are present):
SATA6: ada3 SATA1: ada0 SATA5: ada4 SATA2: ada1 SATA4: ada5 SATA3: ada2
Hints: the web gui admin user is 'admin'. No, it doesn't have the default password.
work log
2019-07-28: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: DEGRADED status: One or more devices has been taken offline by the administrator. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Online the device using 'zpool online' or replace the device with 'zpool replace'. scrub: scrub completed after 6h7m with 0 errors on Sun Jul 28 06:07:59 2019 config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 OFFLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
I really should strip down the machine and clean it, re-seat all cables and so on.
2019-07-07: old drive ada1 - the offline test was fine too.
2019-07-06: I took out the new (ada1) drive , put it in machine c1, and attempted to do a 'zpool labelclear' on the drive there, but it failed to read a label from it. I then checked the old drive, which had a label, and was active on zstore. a 'smartctl -H ..' on the old drive showed that it was fine, a short test also ok, so now I'm running an offline test. It looks like the problem might be cable or connectivity, not the drive itself.
2019-06-23: ada1 - replace. From the Gui, I select Volume status, ada1p2, replace, replace disk. But that failed. Messages
Jun 23 16:42:40 kg-f3 notifier: dd: /dev/ada1: Input/output error Jun 23 16:42:40 kg-f3 notifier: 1+0 records in Jun 23 16:42:40 kg-f3 notifier: 0+0 records out Jun 23 16:42:40 kg-f3 notifier: 0 bytes transferred in 1.162961 secs (0 bytes/sec) Jun 23 16:42:41 kg-f3 notifier: dd: /dev/ada1: Input/output error Jun 23 16:42:41 kg-f3 notifier: 1+0 records in Jun 23 16:42:41 kg-f3 notifier: 0+0 records out Jun 23 16:42:41 kg-f3 notifier: 0 bytes transferred in 0.947624 secs (0 bytes/sec) Jun 23 16:42:49 kg-f3 manage.py: [middleware.exceptions:38] [MiddlewareError: Disk replacement failed: "cannot replace ada1p2 with gptid/28255273-95c5-11e9-b48f-f46d0473b954: I/O error, "]
try again
Jun 23 16:44:40 kg-f3 notifier: 0+0 records out Jun 23 16:44:40 kg-f3 notifier: 0 bytes transferred in 1.151897 secs (0 bytes/sec) Jun 23 16:44:42 kg-f3 notifier: dd: /dev/ada1: Input/output error Jun 23 16:44:42 kg-f3 notifier: 1+0 records in Jun 23 16:44:42 kg-f3 notifier: 0+0 records out Jun 23 16:44:42 kg-f3 notifier: 0 bytes transferred in 1.655303 secs (0 bytes/sec) Jun 23 16:44:46 kg-f3 root: ZFS: vdev failure, zpool=zstore type=vdev.bad_label Jun 23 16:44:47 kg-f3 notifier: warning: device 'ada1p2' onlined, but remains in faulted state Jun 23 16:44:47 kg-f3 notifier: use 'zpool replace' to replace devices that are no longer present Jun 23 16:44:52 kg-f3 notifier: swapon: /dev/ada1p1: device already in use
ok, this isn't going to be easy. From cli
tingo@kg-f3$ zpool status pool: zstore state: DEGRADED status: One or more devices could not be used because the label is missing or invalid. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Replace the device using 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-4J scrub: none requested config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 FAULTED 0 0 0 corrupted data ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
try to offline it
[root@kg-f3] ~# zpool offline zstore ada1p2 [root@kg-f3] ~# zpool status zstore pool: zstore state: DEGRADED status: One or more devices has been taken offline by the administrator. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Online the device using 'zpool online' or replace the device with 'zpool replace'. scrub: none requested config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 OFFLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok, overwrite the bad label. Unfortunatley, no 'lableclear' command
[root@kg-f3] ~# zpool labelclear ada1p2 unrecognized command 'labelclear' [..]
so I'll have to do it the hard way
[root@kg-f3] ~# gpart show ada1 => 34 1953525101 ada1 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G)
with dd
[root@kg-f3] ~# dd if=/dev/zero of=/dev/ada1 bs=1M count=1 dd: /dev/ada1: Operation not permitted
ok, set
[root@kg-f3] ~# sysctl kern.geom.debugflags=16 kern.geom.debugflags: 0 -> 16 [root@kg-f3] ~# sysctl kern.geom.debugflags kern.geom.debugflags: 16
try again, hmm, oh I need swapoff also
[root@kg-f3] ~# swapoff /dev/ada1p1 [root@kg-f3] ~# dd if=/dev/zero of=/dev/ada1 bs=1M count=2 dd: /dev/ada1: Input/output error 1+0 records in 0+0 records out 0 bytes transferred in 0.880132 secs (0 bytes/sec)
check
[root@kg-f3] ~# gpart show ada1 gpart: No such geom: ada1.
ok, fix kern.geom.debugflags
[root@kg-f3] ~# sysctl kern.geom.debugflags kern.geom.debugflags: 16 [root@kg-f3] ~# sysctl kern.geom.debugflags=0 kern.geom.debugflags: 16 -> 0
then retry replacement fom gui - didn't work. Ok, from command line the
status pool: zstore state: DEGRADED status: One or more devices has been taken offline by the administrator. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Online the device using 'zpool online' or replace the device with 'zpool replace'. scrub: none requested config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 OFFLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
online
[root@kg-f3] ~# zpool online zstore ada1p2 warning: device 'ada1p2' onlined, but remains in faulted state use 'zpool replace' to replace devices that are no longer present
status
[root@kg-f3] ~# zpool status pool: zstore state: DEGRADED status: One or more devices could not be used because the label is missing or invalid. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Replace the device using 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-4J scrub: none requested config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 FAULTED 0 0 0 corrupted data ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
hmm, will replace work?
[root@kg-f3] ~# zpool replace zstore ada1p2 cannot replace ada1p2 with ada1p2: I/O error
hmm
[root@kg-f3] ~# tail /var/log/messages Jun 23 17:13:51 kg-f3 kernel: (ada1:ahcich1:0:0:0): lost device Jun 23 17:13:51 kg-f3 kernel: (ada1:ahcich1:0:0:0): removing device entry Jun 23 17:14:03 kg-f3 kernel: ada1 at ahcich1 bus 0 scbus1 target 0 lun 0 Jun 23 17:14:03 kg-f3 kernel: ada1: <ST1000LM024 HN-M101MBB 2BA30001> ATA-8 SATA 3.x device Jun 23 17:14:03 kg-f3 kernel: ada1: 600.000MB/s transfers (SATA 3.x, UDMA6, PIO 8192bytes) Jun 23 17:14:03 kg-f3 kernel: ada1: Command Queueing enabled Jun 23 17:14:03 kg-f3 kernel: ada1: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) Jun 23 17:16:58 kg-f3 root: ZFS: vdev failure, zpool=zstore type=vdev.corrupt_data Jun 23 17:20:42 kg-f3 kernel: ahcich1: Timeout on slot 9 port 0 Jun 23 17:20:42 kg-f3 kernel: ahcich1: is 04000000 cs 0001c000 ss 0001fe00 rs 0001fe00 tfd c0 serr 00400001
and
[root@kg-f3] ~# smartctl -H /dev/ada1 smartctl 5.42 2011-10-20 r3458 [FreeBSD 8.2-RELEASE-p9 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED
if it is not the drive, the cables maybe?
2019-06-23: zpool status says
tingo@kg-f3$ zpool status pool: zstore state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-9P scrub: scrub completed after 6h57m with 0 errors on Sun Jun 16 06:57:24 2019 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 3 7.00K 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
I offline the drive from the gui
tingo@kg-f3$ zpool status pool: zstore state: DEGRADED status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-9P scrub: scrub completed after 6h57m with 0 errors on Sun Jun 16 06:57:24 2019 config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 OFFLINE 3 7.00K 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
time to physically replace the drive. I replaced the drive physically, but it didn't show up automatically, so I rebooted the machine. A reboot didn't work, So I had to power off and on.
2019-06-22: hmm , looks like ada1 is having trouble. From /var/log/messages
Jun 22 23:12:44 kg-f3 kernel: ahcich1: Timeout on slot 14 port 0 Jun 22 23:12:44 kg-f3 kernel: ahcich1: is 00000000 cs 00ff0000 ss 00ffc000 rs 00ffc000 tfd 40 serr 00000000 Jun 22 23:13:15 kg-f3 kernel: ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jun 22 23:13:45 kg-f3 kernel: ahcich1: Timeout on slot 23 port 0 Jun 22 23:13:45 kg-f3 kernel: ahcich1: is 00000000 cs 00800000 ss 00000000 rs 00800000 tfd 80 serr 00000000 Jun 22 23:14:17 kg-f3 kernel: ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jun 22 23:14:47 kg-f3 kernel: ahcich1: Timeout on slot 23 port 0 Jun 22 23:14:47 kg-f3 kernel: ahcich1: is 00000000 cs 00800000 ss 00000000 rs 00800000 tfd 80 serr 00000000 Jun 22 23:14:47 kg-f3 kernel: (ada1:ahcich1:0:0:0): lost device Jun 22 23:15:18 kg-f3 kernel: ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
and
Jun 22 23:19:47 kg-f3 root: ZFS: vdev I/O failure, zpool=zstore path=/dev/ada1p2 offset=270336 size=8192 error=6 Jun 22 23:19:47 kg-f3 root: ZFS: vdev I/O failure, zpool=zstore path=/dev/ada1p2 offset=998056665088 size=8192 error=6 Jun 22 23:19:47 kg-f3 root: ZFS: vdev I/O failure, zpool=zstore path=/dev/ada1p2 offset=998056927232 size=8192 error=6 Jun 22 23:20:37 kg-f3 kernel: ahcich1: AHCI reset: device not ready after 31000ms (tfd = 00000080)
the device is missing
tingo@kg-f3$ ls -l /dev/ada* crw-r----- 1 root operator 0, 99 Mar 20 17:47 /dev/ada0 crw-r----- 1 root operator 0, 111 Mar 20 17:47 /dev/ada0p1 crw-r----- 1 root operator 0, 112 Mar 20 17:47 /dev/ada0p2 crw-r----- 1 root operator 0, 101 Mar 20 17:47 /dev/ada2 crw-r----- 1 root operator 0, 115 Mar 20 17:47 /dev/ada2p1 crw-r----- 1 root operator 0, 116 Mar 20 17:47 /dev/ada2p2 crw-r----- 1 root operator 0, 102 Mar 20 17:47 /dev/ada3 crw-r----- 1 root operator 0, 117 Mar 20 17:47 /dev/ada3p1 crw-r----- 1 root operator 0, 118 Mar 20 17:47 /dev/ada3p2 crw-r----- 1 root operator 0, 103 Mar 20 17:47 /dev/ada4 crw-r----- 1 root operator 0, 119 Mar 20 17:47 /dev/ada4p1 crw-r----- 1 root operator 0, 120 Mar 20 17:47 /dev/ada4p2 crw-r----- 1 root operator 0, 104 Mar 20 17:47 /dev/ada5 crw-r----- 1 root operator 0, 121 Mar 20 17:47 /dev/ada5p1 crw-r----- 1 root operator 0, 122 Mar 20 17:47 /dev/ada5p2
and the (blue) led on that bay is off. Time to replace it I think
2019-06-16: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h57m with 0 errors on Sun Jun 16 06:57:24 2019 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok
2019-05-05: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h1m with 0 errors on Sun May 5 07:01:41 2019 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2019-03-24: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h0m with 0 errors on Sun Mar 24 07:00:32 2019 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2019-02-10: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h39m with 0 errors on Sun Feb 10 06:39:16 2019 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-12-30: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h40m with 0 errors on Sun Dec 30 06:40:25 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-11-18: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h46m with 0 errors on Sun Nov 18 06:46:52 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-10-14: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h44m with 0 errors on Sun Oct 14 06:44:39 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-09-02: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h41m with 0 errors on Sun Sep 2 06:41:42 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-07-22: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h43m with 0 errors on Sun Jul 22 06:43:25 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok
2018-06-10: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h59m with 0 errors on Sun Jun 10 06:59:48 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-04-29: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h8m with 0 errors on Sun Apr 29 07:08:30 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-03-18: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h55m with 0 errors on Sun Mar 18 07:55:21 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2018-02-04: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h20m with 0 errors on Sun Feb 4 07:20:05 2018 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-11-12: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h44m with 0 errors on Sun Nov 12 06:44:23 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-10-08: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h56m with 0 errors on Sun Oct 8 06:56:59 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-08-27: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h21m with 0 errors on Sun Aug 27 07:21:59 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-07-16: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h47m with 0 errors on Sun Jul 16 06:47:22 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-06-04: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h55m with 0 errors on Sun Jun 4 06:55:27 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-04-23: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h15m with 0 errors on Sun Apr 23 07:16:02 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-03-12: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h56m with 0 errors on Sun Mar 12 07:56:16 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2017-01-29: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 8h27m with 0 errors on Sun Jan 29 08:27:10 2017 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-12-18: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h37m with 0 errors on Sun Dec 18 06:37:21 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-11-06: latest scrub result
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h42m with 0 errors on Sun Nov 6 06:42:03 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-10-02: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h39m with 0 errors on Sun Oct 2 06:39:26 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-08-25: uptime status:
tingo@kg-f3$ date;uptime Thu Aug 25 22:25:26 CEST 2016 10:25PM up 557 days, 3:15, 1 user, load averages: 0.00, 0.01, 0.00
ok.
2016-08-21: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h52m with 0 errors on Sun Aug 21 06:52:47 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-07-10: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h53m with 0 errors on Sun Jul 10 06:53:07 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-05-29: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h52m with 0 errors on Sun May 29 06:52:27 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-04-17: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h49m with 0 errors on Sun Apr 17 06:49:53 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-03-06: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h47m with 0 errors on Sun Mar 6 06:48:02 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2016-01-24: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h1m with 0 errors on Sun Jan 24 07:02:00 2016 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
that's all.
2015-12-13: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h14m with 0 errors on Sun Dec 13 07:14:57 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2015-11-14: change to new firewall noted in /var/log/messages:
Nov 14 13:48:26 kg-f3 kernel: arp: 10.1.10.1 moved from 00:10:4b:e3:38:8b to 80:ee:73:60:61:0c on re0
nothing more.
2015-11-01: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h9m with 0 errors on Sun Nov 1 07:09:51 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2015-09-20: and this is the entry in /etc/crontab for scrub:
#minute hour mday month wday who command 00 00 * * 7 root PATH="/bin:/sbin:/usr/bin:/usr/sbin:/usr/local/bin:/usr/local/sbin:/root/bin" /usr/local/sbin/scrub -t 35 zstore
ok.
2015-09-20: here is the cron log entry for the scrub:
[root@kg-f3] ~# bzcat /var/log/cron.1.bz2 | grep scrub Sep 20 00:00:00 kg-f3 /usr/sbin/cron[96447]: (root) CMD (PATH="/bin:/sbin:/usr/bin:/usr/sbin:/usr/local/bin:/usr/local/sbin:/root/bin" /usr/local/sbin/scrub -t 35 zstore)
2015-09-20: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h14m with 0 errors on Sun Sep 20 07:14:34 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
ok.
2015-08-09: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 7h7m with 0 errors on Sun Aug 9 07:07:07 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
nothing more.
2015-06-28: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h44m with 0 errors on Sun Jun 28 06:44:43 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
all for now.
2015-05-17: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h40m with 0 errors on Sun May 17 06:40:10 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
nothing more for now.
2015-04-05: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h27m with 0 errors on Sun Apr 5 06:27:10 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
that's all now.
2015-02-22: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h7m with 0 errors on Sun Feb 22 06:07:08 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
all for now.
2015-01-11: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h16m with 0 errors on Sun Jan 11 06:16:29 2015 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
nothing more.
2014-11-30: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h25m with 0 errors on Sun Nov 30 06:25:12 2014 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
all for now.
2014-10-26: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h29m with 0 errors on Sun Oct 26 05:29:11 2014 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
that's all.
2014-09-14: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h10m with 0 errors on Sun Sep 14 06:10:58 2014 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
nothing more.
2014-08-03: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h11m with 0 errors on Sun Aug 3 06:11:49 2014 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
nothing more.
2014-07-11: reboot due to a power failure (local circuit breaker tripped) earlier in the day. Nothing more.
2014-07-06: uptime has passed 400 days:
tingo@kg-f3$ date;uptime Sun Jul 6 12:54:27 CEST 2014 12:54PM up 401 days, 5:25, 1 user, load averages: 0.04, 0.03, 0.00
that's all.
2014-06-22: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h17m with 0 errors on Sun Jun 22 06:17:15 2014 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
nothing more.
2014-05-11: The newest version of FreeNAS is 9.2.1.5, released on April 24th, 2014. This machine still runs FreeNAS-8.2.0-RELEASE-p1-x64 (r11950). Looking at the FreeNAS download site, version 8.3.0, 8.3.1 and 8.3.2 is available.
2014-05-11: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h16m with 0 errors on Sun May 11 06:16:14 2014 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Nothing more.
2013-07-28: latest scrub result:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h44m with 0 errors on Sun Jul 28 06:44:19 2013 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
That's it.
2013-02-24: checking cron log files for scrub command:
tingo@kg-f3$ ll /var/log/cro* -rw------- 1 root wheel - 79155 Mar 24 13:06 /var/log/cron -rw------- 1 root wheel - 3894 Mar 24 07:00 /var/log/cron.0.bz2 -rw------- 1 root wheel - 3561 Mar 23 23:00 /var/log/cron.1.bz2 -rw------- 1 root wheel - 3559 Mar 23 15:00 /var/log/cron.2.bz2 -rw------- 1 root wheel - 3783 Mar 23 07:00 /var/log/cron.3.bz2 tingo@kg-f3$ su - Password: [root@kg-f3] ~# grep scrub /var/log/cron [root@kg-f3] ~# bzcat /var/log/cron.0.bz2 | grep scrub Mar 24 00:00:00 kg-f3 /usr/sbin/cron[4259]: (root) CMD (PATH="/bin:/sbin:/usr/bin:/usr/sbin:/usr/local/bin:/usr/local/sbin:/root/bin" /usr/local/sbin/scrub -t 35 zstore)
Good. And scrub works, too:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h7m with 0 errors on Sun Mar 24 06:07:38 2013 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Nothing more.
2012-12-30: the scheduled zfs scrub ran, no problems:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 6h18m with 0 errors on Sun Dec 30 06:18:09 2012 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
That's all.
2012-12-15: running a disk test. Prepare
[root@kg-f3] ~# cd /mnt/zstore/home-tingo [root@kg-f3] /mnt/zstore/home-tingo# mkdir test [root@kg-f3] /mnt/zstore/home-tingo# cd test
which parameters:
[root@kg-f3] /mnt/zstore/home-tingo/test# zdb -C zstore | grep ashift ashift=9 [root@kg-f3] /mnt/zstore/home-tingo/test# sysctl hw.physmem hw.physmem: 8153223168 [root@kg-f3] /mnt/zstore/home-tingo/test# sysctl kstat.zfs.misc.arcstats.c kstat.zfs.misc.arcstats.c: 5846991325
the test
Command line used: iozone -r 4k -r 8k -r 16k -r 32k -r 64k -r 128k -s 6g -i 0 -i 1 -i 2 Output is in Kbytes/sec Time Resolution = 0.000001 seconds. Processor cache size set to 1024 Kbytes. Processor cache line size set to 32 bytes. File stride size set to 17 * record size. random random bkwd record stride KB reclen write rewrite read reread read write read rewrite read fwrite frewrite fread freread 6291456 4 104553 46250 175951 149704 918 639 6291456 8 126172 47481 169062 120174 1899 1294 6291456 16 129753 47980 204112 132605 3604 2528 6291456 32 158847 53023 229718 188100 6081 4600 6291456 64 198143 61299 194177 170953 9644 10262 6291456 128 225659 205506 156392 162202 14490 243319 iozone test complete.
some output removed for brevity. The test took a really long time.
2012-09-22: I ran a zpool scrub, which passed without problems:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: scrub completed after 5h33m with 0 errors on Sat Sep 22 17:36:59 2012 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Good.
2012-09-22: After upgrade to FreeNAS 8.2.0-RELEASE-p1-x64, the web gui reports:
FreeNAS-8.2.0-RELEASE-p1-x64 (r11950)
(the GUI doesn't report OS info anymore) From the command line:
tingo@kg-f3$ uname -a FreeBSD kg-f3.kg4.no 8.2-RELEASE-p9 FreeBSD 8.2-RELEASE-p9 #0: Thu Jul 19 12:39:10 PDT 2012 root@build.ixsystems.com:/build/home/jpaetzel/8.2.0/os-base/amd64/build/home/jpaetzel/8.2.0/FreeBSD/src/sys/FREENAS.amd64 amd64 tingo@kg-f3$ df -h Filesystem Size Used Avail Capacity Mounted on /dev/ufs/FreeNASs2a 927M 358M 495M 42% / devfs 1.0K 1.0K 0B 100% /dev /dev/md0 4.6M 1.8M 2.4M 43% /etc /dev/md1 824K 2.0K 756K 0% /mnt /dev/md2 149M 7.2M 130M 5% /var /dev/ufs/FreeNASs4 20M 1.1M 17M 6% /data zstore 1.2T 42K 1.2T 0% /mnt/zstore zstore/home-tingo 1.4T 202G 1.2T 14% /mnt/zstore/home-tingo zstore/media 1.3T 79G 1.2T 6% /mnt/zstore/media zstore/xxx 4.2T 3.0T 1.2T 72% /mnt/zstore/xxx
dmesg output: normal. and now swap is back on all partitions too (previously, ada3p1 was missing):
tingo@kg-f3$ swapinfo -h Device 1K-blocks Used Avail Capacity /dev/ada0p1 2097152 0B 2.0G 0% /dev/ada1p1 2097152 0B 2.0G 0% /dev/ada2p1 2097152 0B 2.0G 0% /dev/ada3p1 2097152 0B 2.0G 0% /dev/ada4p1 2097152 0B 2.0G 0% /dev/ada5p1 2097152 0B 2.0G 0% Total 12582912 0B 12G 0%
That's all.
2012-09-22: I upgraded to FreeNAS 8.2.0 via the GUI upgrade method. After the upgrade, the machine got the wrong ip address again (I had a console attached, so it was easy to spot). To fix it, I ssh'ed into the machine:
tingo@kg-v2$ ssh tingo@10.1.161.16 tingo@10.1.161.16's password: Last login: Sat Sep 22 11:14:21 2012 from 10.1.150.23 Copyright (c) 1980, 1983, 1986, 1988, 1990, 1991, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 8.2-RELEASE-p9 (FREENAS.amd64) #0: Thu Jul 19 12:39:10 PDT 2012 FreeNAS (c) 2009-2012, The FreeNAS Development Team All rights reserved. FreeNAS is released under the modified BSD license. For more information, documentation, help or support, go here: http://freenas.org Welcome to FreeNAS
and did this to fix it:
tingo@kg-f3$ su - Password: [root@kg-f3] ~# [root@kg-f3] ~# mount -w / [root@kg-f3] ~# vi /conf/base/etc/dhclient.conf
and adding the line
send dhcp-client-identifier "kg-f3.kg4.no";
to the file before saving it.
[root@kg-f3] ~# mount -r /
Then I rebooted the machine (via the console menu this time). It cam back up with the correct ip address.
2012-09-21: after bios upgrade to version 0902, here is dmesg output: normal.
2012-09-21: this week I have copied a large amount of data (1 TB +) to the machine. It worked well, even if I see these messages in /var/log/messages:
Sep 18 01:37:55 kg-f3 kernel: ahcich3: Timeout on slot 19 port 0 Sep 18 01:37:55 kg-f3 kernel: ahcich3: is 00000000 cs 1ff00000 ss 1ff80000 rs 1ff80000 tfd 40 serr 00000000 Sep 18 01:37:55 kg-f3 kernel: ahcich0: Timeout on slot 0 port 0 Sep 18 01:37:55 kg-f3 kernel: ahcich0: is 00000000 cs 000003fc ss 000003ff rs 000003ff tfd 40 serr 00000000 Sep 19 11:28:16 kg-f3 kernel: ahcich1: Timeout on slot 24 port 0 Sep 19 11:28:16 kg-f3 kernel: ahcich1: is 00000000 cs e0000003 ss ff000003 rs ff000003 tfd 40 serr 00000000 Sep 20 12:45:41 kg-f3 kernel: ahcich3: Timeout on slot 1 port 0 Sep 20 12:45:41 kg-f3 kernel: ahcich3: is 00000000 cs 000007fc ss 000007fe rs 000007fe tfd 40 serr 00000000
the pool still looks ok:
tingo@kg-f3$ zpool status pool: zstore state: ONLINE scrub: none requested config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
That's it.
2012-05-21: Yes, the zpool scrub finished without errors. after that I did a zpool clear to clear the remaining errors. Now the status is:
[root@kg-f3] ~# zpool status pool: zstore state: ONLINE scrub: resilver completed after 2h32m with 0 errors on Mon May 21 01:15:45 2012 config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 344G resilvered ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
and I have copied lots of files to the machine. It seems t work. From the log:
[root@kg-f3] ~# tail /var/log/messages May 20 22:43:38 kg-f3 root: ZFS: checksum mismatch, zpool=zstore path=/dev/ada3p2 offset=70771892224 size=1024 May 20 22:43:38 kg-f3 root: ZFS: checksum mismatch, zpool=zstore path=/dev/ada3p2 offset=68958706688 size=512 May 20 22:43:38 kg-f3 root: ZFS: checksum mismatch, zpool=zstore path=/dev/ada3p2 offset=70771893760 size=1024 May 20 22:43:38 kg-f3 root: ZFS: checksum mismatch, zpool=zstore path=/dev/ada3p2 offset=70771894784 size=1536 May 20 22:43:38 kg-f3 root: ZFS: checksum mismatch, zpool=zstore path=/dev/ada3p2 offset=68946128896 size=512 May 20 22:53:04 kg-f3 ntpd[1528]: kernel time sync status change 2001
the checksum errors are before I did the zpool clear.
2012-05-20: after trying to fix the backplane of the hot swap drive tray, I fire up FreeNAS again. Some status first:
[root@kg-f3] ~# camcontrol devlist <SAMSUNG HN-M101MBB 2AR10001> at scbus0 target 0 lun 0 (ada0,pass0) <SAMSUNG HN-M101MBB 2AR10001> at scbus1 target 0 lun 0 (ada1,pass1) <SAMSUNG HN-M101MBB 2AR10001> at scbus2 target 0 lun 0 (ada2,pass2) <SAMSUNG HN-M101MBB 2AR10001> at scbus3 target 0 lun 0 (ada3,pass3) <SAMSUNG HN-M101MBB 2AR10001> at scbus4 target 0 lun 0 (ada4,pass4) <SAMSUNG HN-M101MBB 2AR10001> at scbus4 target 1 lun 0 (ada5,pass5) <Verbatim STORE N GO 3.00> at scbus6 target 0 lun 0 (da0,pass6) [root@kg-f3] ~# zpool status pool: zstore state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-9P scrub: resilver in progress for 0h4m, 4.08% done, 1h38m to go config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 10 9.82G resilvered ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Then I try a scrub of the pool:
[root@kg-f3] ~# zpool scrub zstore
We shall see if it finishes without errors.
2012-05-15: unfortunately, replacing the ada3 drive didn't help. Perhaps something bad has happened to the drive bay? I must disassemble it and do a visual inspection.
2012-05-15: let's try to replace the ada3 drive by hotswapping it (using the guide in the FreeNAS documentation): first, turn off swapping for that drive:
[root@kg-f3] ~# swapoff /dev/ada3p1
verify:
[root@kg-f3] ~# swapinfo -h Device 1K-blocks Used Avail Capacity /dev/ada0p1 2097152 0B 2.0G 0% /dev/ada1p1 2097152 0B 2.0G 0% /dev/ada2p1 2097152 0B 2.0G 0% /dev/ada4p1 2097152 0B 2.0G 0% /dev/ada5p1 2097152 0B 2.0G 0% Total 10485760 0B 10G 0%
Next offline it from the pool:
[root@kg-f3] ~# zpool offline zstore ada3p2
verify:
[root@kg-f3] ~# zpool status pool: zstore state: DEGRADED status: One or more devices has been taken offline by the administrator. Sufficient replicas exist for the pool to continue functioning in a degraded state. action: Online the device using 'zpool online' or replace the device with 'zpool replace'. scrub: none requested config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 OFFLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Next, physically replace the drive. I pulled out the SATA6 tray, and got this in /var/log/messages:
May 15 23:00:39 kg-f3 kernel: (ada3:ahcich3:0:0:0): lost device May 15 23:00:39 kg-f3 kernel: (ada3:ahcich3:0:0:0): removing device entry
I unscrewed the old drive from the tray, and screwed the new drive in, then put the tray back into its slot. I got this in /var/log/messages:
May 15 23:07:05 kg-f3 kernel: ada3 at ahcich3 bus 0 scbus3 target 0 lun 0 May 15 23:07:05 kg-f3 kernel: ada3: <SAMSUNG HN-M101MBB 2AR10001> ATA-8 SATA 2.x device May 15 23:07:05 kg-f3 kernel: ada3: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) May 15 23:07:05 kg-f3 kernel: ada3: Command Queueing enabled May 15 23:07:05 kg-f3 kernel: ada3: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C)
Cool, I'm ready to configure the new drive now.
[root@kg-f3] ~# gpart create -s gpt ada3 ada3 created
create a swap partition:
[root@kg-f3] ~# gpart add -b 128 -s 4194304 -t freebsd-swap ada3 ada3p1 added
create the zfs partition:
[root@kg-f3] ~# gpart add -t freebsd-zfs ada3 ada3p2 added
verify
[root@kg-f3] ~# gpart show ada3 => 34 1953525101 ada3 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G)
Next, connect up the new drive: enable swap again:
[root@kg-f3] ~# service swap1 start
and verify:
[root@kg-f3] ~# swapinfo -h Device 1K-blocks Used Avail Capacity /dev/ada0p1 2097152 0B 2.0G 0% /dev/ada1p1 2097152 0B 2.0G 0% /dev/ada2p1 2097152 0B 2.0G 0% /dev/ada4p1 2097152 0B 2.0G 0% /dev/ada5p1 2097152 0B 2.0G 0% /dev/ada3p1 2097152 0B 2.0G 0% Total 12582912 0B 12G 0%
Replace the drive in the zpool:
[root@kg-f3] ~# zpool replace zstore ada3p2
and verify:
[root@kg-f3] ~# zpool status pool: zstore state: DEGRADED status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state. action: Wait for the resilver to complete. scrub: resilver in progress for 0h2m, 0.01% done, 542h2m to go config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 replacing DEGRADED 0 0 0 ada3p2/old OFFLINE 0 0 0 ada3p2 ONLINE 0 0 0 20.1M resilvered ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Ok, detach the old drive, it's of no use now:
[root@kg-f3] ~# zpool detach zstore ada3p2/old
and then I get device timeouts on ahcich3 again, and lose ada3 again. Darn - not good. Perhaps the drive tray has power problems. Ok, dissembled the machine - one of the power cables to the hot swap drive bay was loose. re-seated, re-assembled the machine, and is back up again:
[root@kg-f3] ~# zpool status pool: zstore state: DEGRADED status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state. action: Wait for the resilver to complete. scrub: resilver in progress for 0h7m, 0.15% done, 77h48m to go config: NAME STATE READ WRITE CKSUM zstore DEGRADED 0 0 0 raidz1 DEGRADED 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 replacing DEGRADED 0 0 2 ada3p2/old OFFLINE 0 0 0 ada3p2 ONLINE 0 0 0 546M resilvered ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Good. Now re-try the detach command again:
[root@kg-f3] ~# zpool detach zstore ada3p2/old
No errors so far.Is everything ok?
[root@kg-f3] ~# zpool status pool: zstore state: ONLINE status: One or more devices is currently being resilvered. The pool will continue to function, possibly in a degraded state. action: Wait for the resilver to complete. scrub: resilver in progress for 0h1m, 0.22% done, 13h45m to go config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 797M resilvered ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
Yes. Tomorrow I will see if the resilver went allright.
2012-05-15: current disk setup:
[root@kg-f3] ~# swapinfo -h Device 1K-blocks Used Avail Capacity /dev/ada0p1 2097152 0B 2.0G 0% /dev/ada1p1 2097152 0B 2.0G 0% /dev/ada2p1 2097152 0B 2.0G 0% /dev/ada3p1 2097152 0B 2.0G 0% /dev/ada4p1 2097152 0B 2.0G 0% /dev/ada5p1 2097152 0B 2.0G 0% Total 12582912 0B 12G 0% [root@kg-f3] ~# zpool status pool: zstore state: ONLINE status: One or more devices has experienced an unrecoverable error. An attempt was made to correct the error. Applications are unaffected. action: Determine if the device needs to be replaced, and clear the errors using 'zpool clear' or replace the device with 'zpool replace'. see: http://www.sun.com/msg/ZFS-8000-9P scrub: none requested config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 2 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors [root@kg-f3] ~# gpart show ada0 ada1 ada2 ada3 ada4 ada5 => 34 1953525101 ada0 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G) => 34 1953525101 ada1 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G) => 34 1953525101 ada2 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G) => 34 1953525101 ada3 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G) => 34 1953525101 ada4 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G) => 34 1953525101 ada5 GPT (932G) 34 94 - free - (47K) 128 4194304 1 freebsd-swap (2.0G) 4194432 1949330703 2 freebsd-zfs (930G)
That is all.
2012-05-14: checking the logs after testing the ada0 drive:
[root@kg-f3] ~# smartctl -l selftest /dev/ada0 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed without error 00% 3172 - [root@kg-f3] ~# smartctl -l error /dev/ada0 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Error Log Version: 1 No Errors Logged
This one is good.
2012-05-13: testing the ada0 drive:
[root@kg-f3] ~# smartctl -t long /dev/ada0 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION === Sending command: "Execute SMART Extended self-test routine immediately in off-line mode". Drive command "Execute SMART Extended self-test routine immediately in off-line mode" successful. Testing has begun. Please wait 218 minutes for test to complete. Test will complete after Mon May 14 01:42:25 2012 Use smartctl -X to abort test.
2012-05-12: after a reboot, I'm testing the hard drives with smartctl: First smartctl -H:
[root@kg-f3] ~# smartctl -H /dev/ada0 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED [root@kg-f3] ~# smartctl -H /dev/ada1 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED [root@kg-f3] ~# smartctl -H /dev/ada2 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED [root@kg-f3] ~# smartctl -H /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED [root@kg-f3] ~# smartctl -H /dev/ada4 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED [root@kg-f3] ~# smartctl -H /dev/ada5 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED
Nothing exciting there. Next smartctl -a, doing ada3 first:
[root@kg-f3] ~# smartctl -a /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: SAMSUNG HN-M101MBB Serial Number: S2R8J9HB911149 LU WWN Device Id: 5 0024e9 2061ba6c7 Firmware Version: 2AR10001 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat May 12 18:04:01 2012 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (13020) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 217) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 051 Pre-fail Always - 13 2 Throughput_Performance 0x0026 252 252 000 Old_age Always - 0 3 Spin_Up_Time 0x0023 089 089 025 Pre-fail Always - 3465 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 28 5 Reallocated_Sector_Ct 0x0033 252 252 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 252 252 051 Old_age Always - 0 8 Seek_Time_Performance 0x0024 252 252 015 Old_age Offline - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 3138 10 Spin_Retry_Count 0x0032 252 252 051 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 2 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 28 181 Program_Fail_Cnt_Total 0x0022 100 100 000 Old_age Always - 4347657 191 G-Sense_Error_Rate 0x0022 252 252 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0022 252 252 000 Old_age Always - 0 194 Temperature_Celsius 0x0002 064 061 000 Old_age Always - 33 (Min/Max 19/39) 195 Hardware_ECC_Recovered 0x003a 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 252 252 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 252 252 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 252 252 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0036 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x002a 100 100 000 Old_age Always - 576 223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 2 225 Load_Cycle_Count 0x0032 098 098 000 Old_age Always - 29842 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] Note: selective self-test log revision number (0) not 1 implies that no selective self-test has ever been run SMART Selective self-test log data structure revision number 0 Note: revision number not 1 implies that no selective self-test has ever been run SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Completed [00% left] (0-65535) 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
then ada0:
[root@kg-f3] ~# smartctl -a /dev/ada0 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: SAMSUNG HN-M101MBB Serial Number: S2R8J9HB911201 LU WWN Device Id: 5 0024e9 2061ba9c0 Firmware Version: 2AR10001 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat May 12 18:06:11 2012 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (13080) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 218) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 051 Pre-fail Always - 2 2 Throughput_Performance 0x0026 252 252 000 Old_age Always - 0 3 Spin_Up_Time 0x0023 089 089 025 Pre-fail Always - 3470 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 28 5 Reallocated_Sector_Ct 0x0033 252 252 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 252 252 051 Old_age Always - 0 8 Seek_Time_Performance 0x0024 252 252 015 Old_age Offline - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 3140 10 Spin_Retry_Count 0x0032 252 252 051 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 31 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 28 181 Program_Fail_Cnt_Total 0x0022 100 100 000 Old_age Always - 4311907 191 G-Sense_Error_Rate 0x0022 252 252 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0022 252 252 000 Old_age Always - 0 194 Temperature_Celsius 0x0002 064 062 000 Old_age Always - 32 (Min/Max 19/38) 195 Hardware_ECC_Recovered 0x003a 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 252 252 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 252 252 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 252 252 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0036 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x002a 100 100 000 Old_age Always - 281 223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 31 225 Load_Cycle_Count 0x0032 098 098 000 Old_age Always - 30091 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] Note: selective self-test log revision number (0) not 1 implies that no selective self-test has ever been run SMART Selective self-test log data structure revision number 0 Note: revision number not 1 implies that no selective self-test has ever been run SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Completed [00% left] (0-65535) 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
followed by ada1:
[root@kg-f3] ~# smartctl -a /dev/ada1 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: SAMSUNG HN-M101MBB Serial Number: S2R8J9HB911056 LU WWN Device Id: 5 0024e9 2061ba00d Firmware Version: 2AR10001 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat May 12 18:07:47 2012 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (13320) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 222) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 051 Pre-fail Always - 2 2 Throughput_Performance 0x0026 252 252 000 Old_age Always - 0 3 Spin_Up_Time 0x0023 089 089 025 Pre-fail Always - 3457 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 28 5 Reallocated_Sector_Ct 0x0033 252 252 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 252 252 051 Old_age Always - 0 8 Seek_Time_Performance 0x0024 252 252 015 Old_age Offline - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 3140 10 Spin_Retry_Count 0x0032 252 252 051 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 13 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 28 181 Program_Fail_Cnt_Total 0x0022 100 100 000 Old_age Always - 4381617 191 G-Sense_Error_Rate 0x0022 252 252 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0022 252 252 000 Old_age Always - 0 194 Temperature_Celsius 0x0002 064 061 000 Old_age Always - 32 (Min/Max 19/39) 195 Hardware_ECC_Recovered 0x003a 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 252 252 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 252 252 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 252 252 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0036 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x002a 100 100 000 Old_age Always - 515 223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 13 225 Load_Cycle_Count 0x0032 098 098 000 Old_age Always - 29902 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] Note: selective self-test log revision number (0) not 1 implies that no selective self-test has ever been run SMART Selective self-test log data structure revision number 0 Note: revision number not 1 implies that no selective self-test has ever been run SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Completed [00% left] (0-65535) 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
followed by ada2:
[root@kg-f3] ~# smartctl -a /dev/ada2 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: SAMSUNG HN-M101MBB Serial Number: S2R8J9HB911213 LU WWN Device Id: 5 0024e9 2061baa56 Firmware Version: 2AR10001 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat May 12 18:08:43 2012 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (13260) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 221) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 051 Pre-fail Always - 0 2 Throughput_Performance 0x0026 252 252 000 Old_age Always - 0 3 Spin_Up_Time 0x0023 089 089 025 Pre-fail Always - 3456 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 28 5 Reallocated_Sector_Ct 0x0033 252 252 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 252 252 051 Old_age Always - 0 8 Seek_Time_Performance 0x0024 252 252 015 Old_age Offline - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 3140 10 Spin_Retry_Count 0x0032 252 252 051 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 12 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 28 181 Program_Fail_Cnt_Total 0x0022 100 100 000 Old_age Always - 4313666 191 G-Sense_Error_Rate 0x0022 252 252 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0022 252 252 000 Old_age Always - 0 194 Temperature_Celsius 0x0002 064 061 000 Old_age Always - 32 (Min/Max 19/39) 195 Hardware_ECC_Recovered 0x003a 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 252 252 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 252 252 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 252 252 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0036 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x002a 100 100 000 Old_age Always - 1839 223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 12 225 Load_Cycle_Count 0x0032 098 098 000 Old_age Always - 29831 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] Note: selective self-test log revision number (0) not 1 implies that no selective self-test has ever been run SMART Selective self-test log data structure revision number 0 Note: revision number not 1 implies that no selective self-test has ever been run SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Completed [00% left] (0-65535) 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
followed by ada4:
[root@kg-f3] ~# smartctl -a /dev/ada4 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: SAMSUNG HN-M101MBB Serial Number: S2R8J9HB911216 LU WWN Device Id: 5 0024e9 2061baa82 Firmware Version: 2AR10001 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat May 12 18:09:45 2012 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (12960) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 216) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 051 Pre-fail Always - 1 2 Throughput_Performance 0x0026 252 252 000 Old_age Always - 0 3 Spin_Up_Time 0x0023 089 089 025 Pre-fail Always - 3468 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 28 5 Reallocated_Sector_Ct 0x0033 252 252 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 252 252 051 Old_age Always - 0 8 Seek_Time_Performance 0x0024 252 252 015 Old_age Offline - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 3140 10 Spin_Retry_Count 0x0032 252 252 051 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 9 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 28 181 Program_Fail_Cnt_Total 0x0022 100 100 000 Old_age Always - 4165060 191 G-Sense_Error_Rate 0x0022 252 252 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0022 252 252 000 Old_age Always - 0 194 Temperature_Celsius 0x0002 064 060 000 Old_age Always - 33 (Min/Max 19/40) 195 Hardware_ECC_Recovered 0x003a 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 252 252 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 252 252 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 252 252 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0036 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x002a 100 100 000 Old_age Always - 7826 223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 9 225 Load_Cycle_Count 0x0032 098 098 000 Old_age Always - 29520 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] Note: selective self-test log revision number (0) not 1 implies that no selective self-test has ever been run SMART Selective self-test log data structure revision number 0 Note: revision number not 1 implies that no selective self-test has ever been run SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Completed [00% left] (0-65535) 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay.
followed by ada5:
[root@kg-f3] ~# smartctl -a /dev/ada5 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF INFORMATION SECTION === Device Model: SAMSUNG HN-M101MBB Serial Number: S2R8J9HB911211 LU WWN Device Id: 5 0024e9 2061baa25 Firmware Version: 2AR10001 User Capacity: 1,000,204,886,016 bytes [1.00 TB] Sector Size: 512 bytes logical/physical Device is: Not in smartctl database [for details use: -P showall] ATA Version is: 8 ATA Standard is: ATA-8-ACS revision 6 Local Time is: Sat May 12 18:10:35 2012 CEST SMART support is: Available - device has SMART capability. SMART support is: Enabled === START OF READ SMART DATA SECTION === SMART overall-health self-assessment test result: PASSED General SMART Values: Offline data collection status: (0x00) Offline data collection activity was never started. Auto Offline Data Collection: Disabled. Self-test execution status: ( 0) The previous self-test routine completed without error or no self-test has ever been run. Total time to complete Offline data collection: (13320) seconds. Offline data collection capabilities: (0x5b) SMART execute Offline immediate. Auto Offline data collection on/off support. Suspend Offline collection upon new command. Offline surface scan supported. Self-test supported. No Conveyance Self-test supported. Selective Self-test supported. SMART capabilities: (0x0003) Saves SMART data before entering power-saving mode. Supports SMART auto save timer. Error logging capability: (0x01) Error logging supported. General Purpose Logging supported. Short self-test routine recommended polling time: ( 2) minutes. Extended self-test routine recommended polling time: ( 222) minutes. SCT capabilities: (0x003f) SCT Status supported. SCT Error Recovery Control supported. SCT Feature Control supported. SCT Data Table supported. SMART Attributes Data Structure revision number: 16 Vendor Specific SMART Attributes with Thresholds: ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE 1 Raw_Read_Error_Rate 0x002f 100 100 051 Pre-fail Always - 1 2 Throughput_Performance 0x0026 252 252 000 Old_age Always - 0 3 Spin_Up_Time 0x0023 089 089 025 Pre-fail Always - 3455 4 Start_Stop_Count 0x0032 100 100 000 Old_age Always - 28 5 Reallocated_Sector_Ct 0x0033 252 252 010 Pre-fail Always - 0 7 Seek_Error_Rate 0x002e 252 252 051 Old_age Always - 0 8 Seek_Time_Performance 0x0024 252 252 015 Old_age Offline - 0 9 Power_On_Hours 0x0032 100 100 000 Old_age Always - 3140 10 Spin_Retry_Count 0x0032 252 252 051 Old_age Always - 0 11 Calibration_Retry_Count 0x0032 100 100 000 Old_age Always - 40 12 Power_Cycle_Count 0x0032 100 100 000 Old_age Always - 28 181 Program_Fail_Cnt_Total 0x0022 100 100 000 Old_age Always - 4239754 191 G-Sense_Error_Rate 0x0022 252 252 000 Old_age Always - 0 192 Power-Off_Retract_Count 0x0022 252 252 000 Old_age Always - 0 194 Temperature_Celsius 0x0002 064 061 000 Old_age Always - 33 (Min/Max 19/39) 195 Hardware_ECC_Recovered 0x003a 100 100 000 Old_age Always - 0 196 Reallocated_Event_Count 0x0032 252 252 000 Old_age Always - 0 197 Current_Pending_Sector 0x0032 252 252 000 Old_age Always - 0 198 Offline_Uncorrectable 0x0030 252 252 000 Old_age Offline - 0 199 UDMA_CRC_Error_Count 0x0036 200 200 000 Old_age Always - 0 200 Multi_Zone_Error_Rate 0x002a 100 100 000 Old_age Always - 584 223 Load_Retry_Count 0x0032 100 100 000 Old_age Always - 40 225 Load_Cycle_Count 0x0032 098 098 000 Old_age Always - 29662 SMART Error Log Version: 1 No Errors Logged SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] Note: selective self-test log revision number (0) not 1 implies that no selective self-test has ever been run SMART Selective self-test log data structure revision number 0 Note: revision number not 1 implies that no selective self-test has ever been run SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS 1 0 0 Completed [00% left] (0-65535) 2 0 0 Not_testing 3 0 0 Not_testing 4 0 0 Not_testing 5 0 0 Not_testing Selective self-test flags (0x0): After scanning selected spans, do NOT read-scan remainder of disk. If Selective self-test is pending on power-up, resume after 0 minute delay. Running tests on ada3, first check the logs: [root@kg-f3] ~# smartctl -l selftest /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 No self-tests have been logged. [To run self-tests, use: smartctl -t] [root@kg-f3] ~# smartctl -l error /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Error Log Version: 1 No Errors Logged
and no errors. Good. Now, the tests:
[root@kg-f3] ~# smartctl -t offline /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION === Sending command: "Execute SMART off-line routine immediately in off-line mode". Drive command "Execute SMART off-line routine immediately in off-line mode" successful. Testing has begun. Please wait 13020 seconds for test to complete. Test will complete after Sat May 12 21:53:24 2012 Use smartctl -X to abort test.
Oops - this one takes a long time. Lets try the short test instead:
[root@kg-f3] ~# smartctl -t short /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION === Sending command: "Execute SMART Short self-test routine immediately in off-line mode". Drive command "Execute SMART Short self-test routine immediately in off-line mode" successful. Testing has begun. Please wait 2 minutes for test to complete. Test will complete after Sat May 12 18:20:13 2012 Use smartctl -X to abort test.
After it has finished, check the logs again:
[root@kg-f3] ~# smartctl -l selftest /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Short offline Completed without error 00% 3138 - [root@kg-f3] ~# smartctl -l error /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Error Log Version: 1 No Errors Logged
Looks good. Now for the long test:
[root@kg-f3] ~# smartctl -t long /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF OFFLINE IMMEDIATE AND SELF-TEST SECTION === Sending command: "Execute SMART Extended self-test routine immediately in off-line mode". Drive command "Execute SMART Extended self-test routine immediately in off-line mode" successful. Testing has begun. Please wait 217 minutes for test to complete. Test will complete after Sat May 12 22:02:57 2012 Use smartctl -X to abort test.
After it has finished, check the logs again:
[root@kg-f3] ~# smartctl -l selftest /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Self-test log structure revision number 1 Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error # 1 Extended offline Completed: read failure 10% 3142 765896344 # 2 Short offline Completed without error 00% 3138 - [root@kg-f3] ~# smartctl -l error /dev/ada3 smartctl 5.41 2011-06-09 r3365 [FreeBSD 8.2-RELEASE-p7 amd64] (local build) Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net === START OF READ SMART DATA SECTION === SMART Error Log Version: 1 No Errors Logged
Ok, the selftest failed. Time to buy a new drive. That's all
2012-05-12: I'm getting errors in /var/log/messages when copying data to the machine:
May 12 16:43:12 kg-f3 kernel: ahcich3: Timeout on slot 31 port 0 May 12 16:43:12 kg-f3 kernel: ahcich3: is 00000000 cs 000001fe ss 800001ff rs 800001ff tfd 40 serr 00000000 May 12 16:43:43 kg-f3 kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) May 12 16:44:13 kg-f3 kernel: ahcich3: Timeout on slot 8 port 0 May 12 16:44:13 kg-f3 kernel: ahcich3: is 00000000 cs 00000100 ss 00000000 rs 00000100 tfd 80 serr 00000000 May 12 16:44:45 kg-f3 kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) May 12 16:45:15 kg-f3 kernel: ahcich3: Timeout on slot 8 port 0 May 12 16:45:15 kg-f3 kernel: ahcich3: is 00000000 cs 00000100 ss 00000000 rs 00000100 tfd 80 serr 00000000 May 12 16:45:15 kg-f3 kernel: (ada3:ahcich3:0:0:0): lost device May 12 16:45:46 kg-f3 kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) May 12 16:46:16 kg-f3 kernel: ahcich3: Timeout on slot 8 port 0 May 12 16:46:16 kg-f3 kernel: ahcich3: is 00000000 cs 0003ff00 ss 0003ff00 rs 0003ff00 tfd 80 serr 00000000 May 12 16:46:47 kg-f3 kernel: xptioctl: pass driver is not in the kernel May 12 16:46:47 kg-f3 kernel: xptioctl: put "device pass" in your kernel config file May 12 16:47:05 kg-f3 kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) May 12 16:47:05 kg-f3 kernel: ahcich3: Poll timeout on slot 17 port 0 May 12 16:47:05 kg-f3 kernel: ahcich3: is 00000000 cs 00020000 ss 00000000 rs 00020000 tfd 80 serr 00000000 May 12 16:47:35 kg-f3 kernel: ahcich3: Timeout on slot 17 port 0 May 12 16:47:35 kg-f3 kernel: ahcich3: is 00000000 cs 07fe0000 ss 07fe0000 rs 07fe0000 tfd 80 serr 00000000 May 12 16:48:25 kg-f3 kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) May 12 16:48:25 kg-f3 kernel: ahcich3: Poll timeout on slot 26 port 0 May 12 16:48:25 kg-f3 kernel: ahcich3: is 00000000 cs 04000000 ss 00000000 rs 04000000 tfd 80 serr 00000000 May 12 16:48:55 kg-f3 kernel: ahcich3: Timeout on slot 26 port 0 May 12 16:48:55 kg-f3 kernel: ahcich3: is 00000000 cs fc00000f ss fc00000f rs fc00000f tfd 80 serr 00000000
Not good. Is the hard drive at adad3 dying? For reference here is the relevant dmesg output: controllers
May 11 23:46:23 kg-f3 kernel: ahci0: <ATI IXP700 AHCI SATA controller> port 0xf190-0xf197,0xf180-0xf183,0xf170-0xf177,0xf160-0xf163,0xf150-0xf15f mem 0xfeb4b000-0xfeb4b3ff irq 19 at device 17.0 on pci0 May 11 23:46:23 kg-f3 kernel: ahci0: [ITHREAD] May 11 23:46:23 kg-f3 kernel: ahci0: AHCI v1.20 with 4 6Gbps ports, Port Multiplier supported May 11 23:46:23 kg-f3 kernel: ahcich0: <AHCI channel> at channel 0 on ahci0 May 11 23:46:23 kg-f3 kernel: ahcich0: [ITHREAD] May 11 23:46:23 kg-f3 kernel: ahcich1: <AHCI channel> at channel 1 on ahci0 May 11 23:46:23 kg-f3 kernel: ahcich1: [ITHREAD] May 11 23:46:23 kg-f3 kernel: ahcich2: <AHCI channel> at channel 2 on ahci0 May 11 23:46:23 kg-f3 kernel: ahcich2: [ITHREAD] May 11 23:46:23 kg-f3 kernel: ahcich3: <AHCI channel> at channel 3 on ahci0 May 11 23:46:23 kg-f3 kernel: ahcich3: [ITHREAD]
...
May 11 23:46:23 kg-f3 kernel: atapci0: <ATI IXP700/800 UDMA133 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xf100-0xf10f irq 17 at device 20.1 on pci0 May 11 23:46:23 kg-f3 kernel: ata0: <ATA channel 0> on atapci0 May 11 23:46:23 kg-f3 kernel: ata0: [ITHREAD] May 11 23:46:23 kg-f3 kernel: ata1: <ATA channel 1> on atapci0 May 11 23:46:23 kg-f3 kernel: ata1: [ITHREAD]
... usb memory stick (boot device)
May 11 23:46:23 kg-f3 kernel: ugen1.2: <Verbatim> at usbus1 May 11 23:46:23 kg-f3 kernel: umass0: <Verbatim STORE N GO, class 0/0, rev 2.00/1.00, addr 2> on usbus1 May 11 23:46:23 kg-f3 kernel: da0: <Verbatim STORE N GO 3.00> Removable Direct Access SCSI-2 device May 11 23:46:23 kg-f3 kernel: da0: 40.000MB/s transfers May 11 23:46:23 kg-f3 kernel: da0: 3823MB (7829504 512 byte sectors: 255H 63S/T 487C)
... hard drives (storage)
May 11 23:46:23 kg-f3 kernel: ada0 at ahcich0 bus 0 scbus0 target 0 lun 0 May 11 23:46:23 kg-f3 kernel: ada0: <SAMSUNG HN-M101MBB 2AR10001> ATA-8 SATA 2.x device May 11 23:46:23 kg-f3 kernel: ada0: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) May 11 23:46:23 kg-f3 kernel: ada0: Command Queueing enabledda0 at umass-sim0 bus 0 scbus6 target 0 lun 0 May 11 23:46:23 kg-f3 kernel: ada0: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) May 11 23:46:23 kg-f3 kernel: ada1 at ahcich1 bus 0 scbus1 target 0 lun 0 May 11 23:46:23 kg-f3 kernel: ada1: <SAMSUNG HN-M101MBB 2AR10001> ATA-8 SATA 2.x device May 11 23:46:23 kg-f3 kernel: ada1: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) May 11 23:46:23 kg-f3 kernel: ada1: Command Queueing enabled May 11 23:46:23 kg-f3 kernel: ada1: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) May 11 23:46:23 kg-f3 kernel: ada2 at ahcich2 bus 0 scbus2 target 0 lun 0 May 11 23:46:23 kg-f3 kernel: ada2: <SAMSUNG HN-M101MBB 2AR10001> ATA-8 SATA 2.x device May 11 23:46:23 kg-f3 kernel: ada2: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) May 11 23:46:23 kg-f3 kernel: ada2: Command Queueing enabled(da0:umass-sim0:0:0:0): Synchronize cache failed, status == 0x10, scsi status == 0x0 May 11 23:46:23 kg-f3 kernel: May 11 23:46:23 kg-f3 kernel: ada2: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) May 11 23:46:23 kg-f3 kernel: ada3 at ahcich3 bus 0 scbus3 target 0 lun 0 May 11 23:46:23 kg-f3 kernel: ada3: <SAMSUNG HN-M101MBB 2AR10001> ATA-8 SATA 2.x device May 11 23:46:23 kg-f3 kernel: ada3: 300.000MB/s transfers (SATA 2.x, UDMA6, PIO 8192bytes) May 11 23:46:23 kg-f3 kernel: ada3: Command Queueing enabled May 11 23:46:23 kg-f3 kernel: ada3: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) May 11 23:46:23 kg-f3 kernel: ada4 at ata0 bus 0 scbus4 target 0 lun 0 May 11 23:46:23 kg-f3 kernel: ada4: <SAMSUNG HN-M101MBB 2AR10001> ATA-8 SATA 2.x device May 11 23:46:23 kg-f3 kernel: ada4: 150.000MB/s transfers (SATA, UDMA6, PIO 8192bytes) May 11 23:46:23 kg-f3 kernel: ada4: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C) May 11 23:46:23 kg-f3 kernel: ada5 at ata0 bus 0 scbus4 target 1 lun 0 May 11 23:46:23 kg-f3 kernel: ada5: <SAMSUNG HN-M101MBB 2AR10001> ATA-8 SATA 2.x device May 11 23:46:23 kg-f3 kernel: ada5: 150.000MB/s transfers (SATA, UDMA6, PIO 8192bytes) May 11 23:46:23 kg-f3 kernel: ada5: 953869MB (1953525168 512 byte sectors: 16H 63S/T 16383C)
That's all.
2012-05-11: Upgraded to FreeNAS 8.0.4-RELEASE -p2, using the GUI upgrade method. After the upgrade, the ip address changed again. Fixed by ssh'ing into the machine, and doing:
[root@kg-f3] ~# mount -w / [root@kg-f3] ~# vi /conf/base/etc/dhclient.conf
and adding the line
send dhcp-client-identifier "kg-f3.kg4.no";
to the file before saving it.
[root@kg-f3] ~# mount -r /
then I rebooted the machine via the gui. It came up with the correct ip address this time. From the web gui:
FreeNAS build: FreeNAS-8.0.4-RELEASE-p2-x64 (11367) OS version: FreeBSD 8.2-RELEASE-p7
From the command line:
tingo@kg-f3$ id uid=1001(tingo) gid=1001(users) groups=1001(users),0(wheel) tingo@kg-f3$ uname -a FreeBSD kg-f3.kg4.no 8.2-RELEASE-p7 FreeBSD 8.2-RELEASE-p7 #0: Thu May 3 12:22:17 PDT 2012 jpaetzel@build.ixsystems.com:/home/jpaetzel/8.0.4-RELEASE-p2/obj.amd64/build/home/jpaetzel/8.0.4-RELEASE-p2/FreeBSD/src/sys/FREENAS.amd64 amd64 tingo@kg-f3$ df -h Filesystem Size Used Avail Capacity Mounted on /dev/ufs/FreeNASs1a 927M 379M 474M 44% / devfs 1.0K 1.0K 0B 100% /dev /dev/md0 4.6M 1.9M 2.3M 44% /etc /dev/md1 824K 2.0K 756K 0% /mnt /dev/md2 149M 7.0M 130M 5% /var /dev/ufs/FreeNASs4 20M 603K 18M 3% /data zstore 2.7T 42K 2.7T 0% /mnt/zstore zstore/home-tingo 2.9T 178G 2.7T 6% /mnt/zstore/home-tingo zstore/media 2.8T 70G 2.7T 2% /mnt/zstore/media zstore/xxx 4.2T 1.5T 2.7T 35% /mnt/zstore/xxx
rootfs (/) is back to slice 1 again.
2012-04-25: I upgraded to FreeNAS 8.0.4-RELEASE-p1, using the GUI upgrade method. It worked, but after the upgrade, the machine got a different ip address. I had to peek at my DHCP server to figure it out. From the web gui:
FreeNAS build: FreeNAS-8.0.4-RELEASE-p1-x64 (11059) OS Version: FreeBSD 8.2-RELEASE-p6
I used the web gui to enable some services (SMART and ssh), via Services -> Control services. Then I could connect to the machine via ssh. from command line:
tingo@kg-f3$ id uid=1001(tingo) gid=1001(users) groups=1001(users),0(wheel) tingo@kg-f3$ uname -a FreeBSD kg-f3.kg4.no 8.2-RELEASE-p6 FreeBSD 8.2-RELEASE-p6 #0: Wed Apr 11 10:19:51 PDT 2012 jpaetzel@build.ixsystems.com:/home/jpaetzel/fn_build/8.0.4/obj.amd64/build/home/jpaetzel/fn_build/8.0.4/FreeBSD/src/sys/FREENAS.amd64 amd64 tingo@kg-f3$ df -h Filesystem Size Used Avail Capacity Mounted on /dev/ufs/FreeNASs2a 927M 379M 474M 44% / devfs 1.0K 1.0K 0B 100% /dev /dev/md0 4.6M 1.9M 2.3M 44% /etc /dev/md1 824K 2.0K 756K 0% /mnt /dev/md2 149M 8.8M 128M 6% /var /dev/ufs/FreeNASs4 20M 999K 17M 5% /data zstore 2.8T 42K 2.8T 0% /mnt/zstore zstore/home-tingo 3.0T 176G 2.8T 6% /mnt/zstore/home-tingo zstore/media 2.8T 62G 2.8T 2% /mnt/zstore/media zstore/xxx 4.2T 1.4T 2.8T 34% /mnt/zstore/xxx
I see that rootfs (/) has changed to slice 2 of the usb memory stick now. OK, fix the ip address again, by using my local fix:
[root@kg-f3] ~# mount -w / [root@kg-f3] ~# vi /conf/base/etc/dhclient.conf
add the line
send dhcp-client-identifier "kg-f3.kg4.no";
and save.
[root@kg-f3] ~# mount -w /
then I rebooted the FreeNAS box (via GUI). Does it come up again with the correct ip address? Yes it did. Nice!
2012-01-06: storage info. From shell on my FreeNAS box:
$ uname -a FreeBSD kg-f3.kg4.no 8.2-RELEASE-p5 FreeBSD 8.2-RELEASE-p5 #0: Tue Jan 3 01:23:58 PST 2012 root@build.ixsystems.com:/build/home/gcooper/e2e-bld.RqwJtDXy/obj.amd64/build/home/gcooper/e2e-bld.RqwJtDXy/FreeBSD/src/sys/FREENAS.amd64 amd64 $ zpool list NAME SIZE USED AVAIL CAP HEALTH ALTROOT zstore 5.44T 124G 5.32T 2% ONLINE /mnt $ df -h Filesystem Size Used Avail Capacity Mounted on /dev/ufs/FreeNASs1a 927M 375M 477M 44% / devfs 1.0K 1.0K 0B 100% /dev /dev/md0 4.6M 1.8M 2.3M 44% /etc /dev/md1 824K 2.0K 756K 0% /mnt /dev/md2 149M 8.1M 129M 6% /var /dev/ufs/FreeNASs4 20M 591K 18M 3% /data zstore 4.3T 37K 4.3T 0% /mnt/zstore zstore/home-tingo 4.4T 103G 4.3T 2% /mnt/zstore/home-tingo
So, about 4.4T reported. Looks sane to me. Here is what it looks like when mounted via sshfs on my Linux laptop:
tingo@kg-u35jc:~$ uname -a Linux kg-u35jc 2.6.35-31-generic #63-Ubuntu SMP Mon Nov 28 19:29:10 UTC 2011 x86_64 GNU/Linux tingo@kg-u35jc:~$ lsb_release -a No LSB modules are available. Distributor ID: Ubuntu Description: Ubuntu 10.10 Release: 10.10 Codename: maverick tingo@kg-u35jc:~$ df -h Filesystem Size Used Avail Use% Mounted on /dev/sda5 202G 72G 120G 38% / none 1.9G 296K 1.9G 1% /dev none 1.9G 192K 1.9G 1% /dev/shm none 1.9G 184K 1.9G 1% /var/run none 1.9G 0 1.9G 0% /var/lock none 202G 72G 120G 38% /var/lib/ureadahead/debugfs tingo@f3:/mnt/zstore/home-tingo 1.2P 26T 1.1P 3% /home/tingo/mpoint
Wow, P? As in Petabytes? Funny. :)
2012-01-04: FreeNAS 8.0.3 - install to the usb stick, from my workstation:
root@kg-v2# xzcat FreeNAS-8.0.3-RELEASE-x64.Full_Install.xz | dd of=/dev/da4 bs=64k 0+244141 records in 0+244141 records out 2000000000 bytes transferred in 740.731574 secs (2700033 bytes/sec)
Next, insert into FreeNAS box and boot: yes, it worked. Of course, the resulting FreeNAS box is "blank", so I need to upload the config. Go to the web gui, System, Settings, General, "Upload Config". I selected the previous config file from my workstation, and pressed the "apply" button. After that, the FreeNAS box applied the config, rebooted, updated shema, and rebooted again. It came up again, with the zfs volume (there was only one) already mounted. Nice! Info (from web gui):
FreeNAS build: FreeNAS-8.0.3-RELEASE-x64 (9395) FreeBSD version: FreeBSD 8.2-RELEASE-p5
From command line:
$ id uid=1001(tingo) gid=1001(users) groups=1001(users) $ uname -a FreeBSD kg-f3.kg4.no 8.2-RELEASE-p5 FreeBSD 8.2-RELEASE-p5 #0: Tue Jan 3 01:23:58 PST 2012 root@build.ixsystems.com:/build/home/gcooper/e2e-bld.RqwJtDXy/obj.amd64/build/home/gcooper/e2e-bld.RqwJtDXy/FreeBSD/src/sys/FREENAS.amd64 amd64
Of course, I still need to apply my dhcp fix. From the local console:
%mount -w / %vi /conf/base/etc/dhclient.conf
add the line
send dhcp-client-identifier "kg-f3.kg4.no";
and save.
%mount -r /
then I rebooted the FreeNAS box. It came up again, this time with the correct ip address.
2012-01-04: backup - backed up the config (via the gui), I also backup up /etc/dhclient.conf (which is a copy of /conf/base/etc/dhclient.conf, see earlier entry. Next shutdown, then I put the usb stick in my workstation, and backup up the / (root) partition like this:
tingo@kg-v2$ dd if=/dev/da4s1 of=./8.0.2_backup_s1.image bs=16k 60320+1 records in 60320+1 records out 988291584 bytes transferred in 182.645689 secs (5410977 bytes/sec)
make a backup of the mnt and data partitions too, just to be on the safe side:
tingo@kg-v2$ dd if=/dev/da4s3 of=./8.0.2_backup_s3_mnt.image bs=16k 94+1 records in 94+1 records out 1548288 bytes transferred in 0.322678 secs (4798243 bytes/sec) tingo@kg-v2$ dd if=/dev/da4s4 of=./8.0.2_backup_s4_data.image bs=16k 1291+1 records in 1291+1 records out 21159936 bytes transferred in 3.917126 secs (5401903 bytes/sec)
That's it.
2011-11-04: simplistic disk speed test:
[root@kg-f3] ~# dd if=/dev/zero of=/mnt/zstore/home-tingo/testfile.test bs=8192k count=1000 1000+0 records in 1000+0 records out 8388608000 bytes transferred in 30.255672 secs (277257366 bytes/sec)
So about 277 Mbytes per second. A slightly different test:
[root@kg-f3] ~# dd if=/dev/zero of=/mnt/zstore/home-tingo/testfile2.test bs=1024k count=20000 20000+0 records in 20000+0 records out 20971520000 bytes transferred in 85.479457 secs (245339883 bytes/sec)
That's it.
2011-11-03: temperature monitoring: copying amdtemp.ko from another machine didn't work - kernel / version mismatch. I must boot FreeBSD on this machine to verify if amdtemp.ko works. And acpi_aiboost.ko too.
2011-11-03: fan speed - according to the bios, cpu temp is at 49 degrees C, an chassis fan speed is about 672 rpm with setting standard.. Setting CPU fan speed low limit to "ignore" - this board doesn't have a cpu fan. Chassis fan - changing profile to "Silent" and speed low limit to 200 rpm. The fan is still audible. Hrm. SMBIOS info:
$ kenv | grep smbios smbios.bios.reldate="08/08/2011" smbios.bios.vendor="American Megatrends Inc." smbios.bios.version="0802" smbios.chassis.maker="Chassis Manufacture" smbios.chassis.serial="Chassis Serial Number" smbios.chassis.tag="Asset-1234567890" smbios.chassis.version="Chassis Version" smbios.memory.enabled="8388608" smbios.planar.maker="ASUSTeK Computer INC." smbios.planar.product="E35M1-I" smbios.planar.serial="MT7015054700938" smbios.planar.version="Rev X.0x" smbios.socket.enabled="1" smbios.socket.populated="1" smbios.system.maker="System manufacturer" smbios.system.product="System Product Name" smbios.system.serial="System Serial Number" smbios.system.uuid="78addb00-b83f-11dc-afde-f46d0473b954" smbios.system.version="System Version" smbios.version="2.6"
Not much there. dmidecode output.
2011-11-03: It doesn't look like FreeNAS has the acpi_aiboost kernel module built in. Here is the contents of /boot/kernel:
[root@kg-f3] ~# ls /boot/kernel ./ geom_gate.ko.gz* geom_part_ebr.ko* geom_shsec.ko* libiconv.ko* ../ geom_journal.ko* geom_part_gpt.ko* geom_stripe.ko.gz* libmchain.ko* ext2fs.ko* geom_label.ko* geom_part_mbr.ko* geom_sunlabel.ko* linker.hints geom_bde.ko* geom_linux_lvm.ko* geom_part_pc98.ko* geom_uzip.ko* ntfs.ko.gz* geom_bsd.ko* geom_mbr.ko* geom_part_vtoc8.ko* geom_vinum.ko* opensolaris.ko* geom_cache.ko* geom_mirror.ko.gz* geom_pc98.ko* geom_virstor.ko* smbfs.ko.gz* geom_ccd.ko* geom_multipath.ko* geom_raid.ko* geom_vol_ffs.ko* udf.ko* geom_concat.ko* geom_nop.ko* geom_raid3.ko.gz* geom_zero.ko* xhci.ko* geom_eli.ko* geom_part_apm.ko* geom_raid5.ko* gsched_rr.ko* zfs.ko* geom_fox.ko* geom_part_bsd.ko* geom_sched.ko* kernel.gz*
Interesting.
2011-10-31: network config: I couldn't figure out how to do this from within the FreeNAS gui, so I did it manually. In my network, all machines get ip
addresses via dhcp. But, servers need ip addresses that doesn't change (and it simplifies dns setup too). Solution: my machines send a dhcp client
identifier when they request an ip address. MY dhcp server is setup to deal with that. I mounted root rw: mount -w /
and edited
/conf/base/etc/dhclient.conf to include the line
send dhcp-client-identifier "kg-f3.kg4.no";
then I rebooted the machine. Afterwards, all is well.
2011-10-31: dmesg output: normal.
2011-10-31: pciconf output:
[root@freenas] ~# pciconf -lvc hostb0@pci0:0:0:0: class=0x060000 card=0x84a51043 chip=0x15101022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI vgapci0@pci0:0:1:0: class=0x030000 card=0x84a51043 chip=0x98021002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' class = display subclass = VGA cap 01[50] = powerspec 3 supports D0 D1 D2 D3 current D0 cap 10[58] = PCI-Express 2 root endpoint max data 128(128) link x0(x0) cap 05[a0] = MSI supports 1 message, 64 bit ecap 000b[100] = unknown 1 pcib1@pci0:0:4:0: class=0x060400 card=0x12341022 chip=0x15121022 rev=0x00 hdr=0x01 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = PCI-PCI cap 01[50] = powerspec 3 supports D0 D3 current D0 cap 10[58] = PCI-Express 2 root port max data 128(128) link x16(x4) cap 05[a0] = MSI supports 1 message, 64 bit cap 0d[b0] = PCI Bridge card=0x12341022 cap 08[b8] = HT MSI fixed address window enabled at 0xfee00000 ecap 000b[100] = unknown 1 ahci0@pci0:0:17:0: class=0x01018f card=0x84961043 chip=0x43901002 rev=0x40 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'Integrated SATA II Controller (SB700)' class = mass storage subclass = ATA cap 12[70] = SATA Index-Data Pair cap 13[a4] = PCI Advanced Features: FLR TP ohci0@pci0:0:18:0: class=0x0c0310 card=0x84961043 chip=0x43971002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 USB OHCI0 Controller' class = serial bus subclass = USB ehci0@pci0:0:18:2: class=0x0c0320 card=0x84961043 chip=0x43961002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 USB EHCI Controller' class = serial bus subclass = USB cap 01[c0] = powerspec 2 supports D0 D1 D2 D3 current D0 cap 0a[e4] = EHCI Debug Port at offset 0xe0 in map 0x14 ohci1@pci0:0:19:0: class=0x0c0310 card=0x84961043 chip=0x43971002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 USB OHCI0 Controller' class = serial bus subclass = USB ehci1@pci0:0:19:2: class=0x0c0320 card=0x84961043 chip=0x43961002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 USB EHCI Controller' class = serial bus subclass = USB cap 01[c0] = powerspec 2 supports D0 D1 D2 D3 current D0 cap 0a[e4] = EHCI Debug Port at offset 0xe0 in map 0x14 none0@pci0:0:20:0: class=0x0c0500 card=0x84961043 chip=0x43851002 rev=0x42 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'ATI SMBus (ATI RD600/RS600)' class = serial bus subclass = SMBus atapci0@pci0:0:20:1: class=0x01018a card=0x84961043 chip=0x439c1002 rev=0x40 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'PATA 133 Controller (SB7xx)' class = mass storage subclass = ATA none1@pci0:0:20:2: class=0x040300 card=0x84151043 chip=0x43831002 rev=0x40 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'IXP SB600 High Definition Audio Controller' class = multimedia subclass = HDA cap 01[50] = powerspec 2 supports D0 D3 current D0 isab0@pci0:0:20:3: class=0x060100 card=0x84961043 chip=0x439d1002 rev=0x40 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 LPC host controller' class = bridge subclass = PCI-ISA pcib2@pci0:0:20:4: class=0x060401 card=0x00000000 chip=0x43841002 rev=0x40 hdr=0x01 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'IXP SB600 PCI to PCI Bridge' class = bridge subclass = PCI-PCI ohci2@pci0:0:20:5: class=0x0c0310 card=0x84961043 chip=0x43991002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 USB OHCI2 Controller' class = serial bus subclass = USB pcib3@pci0:0:21:0: class=0x060400 card=0x00001002 chip=0x43a01002 rev=0x00 hdr=0x01 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' class = bridge subclass = PCI-PCI cap 01[50] = powerspec 3 supports D0 D1 D2 D3 current D0 cap 10[58] = PCI-Express 2 root port max data 128(128) link x16(x1) cap 05[a0] = MSI supports 1 message, 64 bit cap 0d[b0] = PCI Bridge card=0x00001002 cap 08[b8] = HT MSI fixed address window enabled at 0xfee00000 ecap 000b[100] = unknown 1 pcib4@pci0:0:21:1: class=0x060400 card=0x00001002 chip=0x43a11002 rev=0x00 hdr=0x01 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' class = bridge subclass = PCI-PCI cap 01[50] = powerspec 3 supports D0 D1 D2 D3 current D0 cap 10[58] = PCI-Express 2 root port max data 128(128) link x1(x1) cap 05[a0] = MSI supports 1 message, 64 bit cap 0d[b0] = PCI Bridge card=0x00001002 cap 08[b8] = HT MSI fixed address window enabled at 0xfee00000 ecap 000b[100] = unknown 1 ohci3@pci0:0:22:0: class=0x0c0310 card=0x84961043 chip=0x43971002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 USB OHCI0 Controller' class = serial bus subclass = USB ehci2@pci0:0:22:2: class=0x0c0320 card=0x84961043 chip=0x43961002 rev=0x00 hdr=0x00 vendor = 'ATI Technologies Inc. / Advanced Micro Devices, Inc.' device = 'SB700 USB EHCI Controller' class = serial bus subclass = USB cap 01[c0] = powerspec 2 supports D0 D1 D2 D3 current D0 cap 0a[e4] = EHCI Debug Port at offset 0xe0 in map 0x14 hostb1@pci0:0:24:0: class=0x060000 card=0x00000000 chip=0x17001022 rev=0x43 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI hostb2@pci0:0:24:1: class=0x060000 card=0x00000000 chip=0x17011022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI hostb3@pci0:0:24:2: class=0x060000 card=0x00000000 chip=0x17021022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI hostb4@pci0:0:24:3: class=0x060000 card=0x00000000 chip=0x17031022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI cap 0f[f0] = unknown hostb5@pci0:0:24:4: class=0x060000 card=0x00000000 chip=0x17041022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI hostb6@pci0:0:24:5: class=0x060000 card=0x00000000 chip=0x17181022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI hostb7@pci0:0:24:6: class=0x060000 card=0x00000000 chip=0x17161022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI hostb8@pci0:0:24:7: class=0x060000 card=0x00000000 chip=0x17191022 rev=0x00 hdr=0x00 vendor = 'Advanced Micro Devices (AMD)' class = bridge subclass = HOST-PCI re0@pci0:4:0:0: class=0x020000 card=0x84321043 chip=0x816810ec rev=0x06 hdr=0x00 vendor = 'Realtek Semiconductor' device = 'Gigabit Ethernet NIC(NDIS 6.0) (RTL8168/8111/8111c)' class = network subclass = ethernet cap 01[40] = powerspec 3 supports D0 D1 D2 D3 current D0 cap 05[50] = MSI supports 1 message, 64 bit cap 10[70] = PCI-Express 2 endpoint IRQ 2 max data 128(256) link x1(x1) cap 11[b0] = MSI-X supports 4 messages in map 0x20 enabled cap 03[d0] = VPD ecap 0001[100] = AER 1 0 fatal 0 non-fatal 2 corrected ecap 0002[140] = VC 1 max VC0 ecap 0003[160] = Serial 1 3e000000684ce000
Long output.
2011-10-31: FreeNAS buiild (from web gui) is: FreeNAS-8.0.2-RELEASE-amd64 (8288).
2011-10-31: Here is what FreeNAS looks like from the command line:
tingo@kg-v2$ ssh tingo@10.1.161.16 tingo@10.1.161.16's password: Last login: Mon Oct 31 18:29:37 2011 from 10.1.150.17 Copyright (c) 1980, 1983, 1986, 1988, 1990, 1991, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 8.2-RELEASE-p3 (FREENAS.amd64) #7: Fri Sep 30 12:51:49 PDT 2011 FreeNAS nanobsd (c) 2009-2010, The FreeNAS Development Team All rights reserved. FreeNAS is under the modified BSD license. For more information, documentation, help or support, go here: http://freenas.org Welcome to FreeNAS
and here is the storage space:
$ zpool list NAME SIZE USED AVAIL CAP HEALTH ALTROOT zstore 5.44T 5.26G 5.43T 0% ONLINE /mnt $ zpool status pool: zstore state: ONLINE scrub: none requested config: NAME STATE READ WRITE CKSUM zstore ONLINE 0 0 0 raidz1 ONLINE 0 0 0 ada0p2 ONLINE 0 0 0 ada1p2 ONLINE 0 0 0 ada2p2 ONLINE 0 0 0 ada3p2 ONLINE 0 0 0 ada4p2 ONLINE 0 0 0 ada5p2 ONLINE 0 0 0 errors: No known data errors
mount info:
$ mount /dev/ufs/FreeNASs1a on / (ufs, local, read-only, soft-updates) devfs on /dev (devfs, local, multilabel) /dev/md0 on /etc (ufs, local) /dev/md1 on /mnt (ufs, local) /dev/md2 on /var (ufs, local) /dev/ufs/FreeNASs4 on /data (ufs, local, noatime, soft-updates) zstore on /mnt/zstore (zfs, local) zstore/home-tingo on /mnt/zstore/home-tingo (zfs, local)
current free space info:
$ df -h Filesystem Size Used Avail Capacity Mounted on /dev/ufs/FreeNASs1a 927M 429M 424M 50% / devfs 1.0K 1.0K 0B 100% /dev /dev/md0 4.3M 3.6M 392K 90% /etc /dev/md1 732K 16K 660K 2% /mnt /dev/md2 75M 16M 53M 23% /var /dev/ufs/FreeNASs4 20M 362K 18M 2% /data zstore 4.4T 37K 4.4T 0% /mnt/zstore zstore/home-tingo 4.4T 5.9G 4.4T 0% /mnt/zstore/home-tingo
What else?
2011-10-31: more configuring in FreeNAS. I set up a zfs volume (zstore) which is mounted on /mnt/zstore, and created a zfs dataset home-tingo, which is mounted on /mnt/zstore/home-tingo. Not ideal. I created a user, and gave him access rights to the home-tingo dataset. I enabled ssh from services, and tried logging in to the FreeNAS box. It works.
2011-10-31: booting FreeNAS - I inserted the memory stick in the machine and rebooted. The machine booted automatically from the stick, the boot took some time, but finally the FreeNAS 'Console setup' menu was displayed, and a url to access the machine. Web access works.
2011-10-31: I wrote the FreeNAS 8.0.2-RELEASE image to a Verbatim Store n go usb memory stick (4 GB), using my FreeBSD workstation and the command:
# xzcat FreeNAS-8.0.2-RELEASE-amd64-Full_Install.xz | dd of=/dev/ad4 bs=64k
That was all.