Intel S4610 1.92TB and 3.84TB SKUs may become unresponsive at 1700 hrs. of cumulative Idle Power On Hours.

今回はタイトルの Bug 対応についてです. vSAN の Capacity 用なので放置していたら...と考えただけでもゾッとします. Fail 後は BIOS からも認識されなくなるらしいです. (TMN 安田さん,情報提供ありがとうございます!)

https://downloadmirror.intel.com/28639/eng/Intel_SSD_Data_Center_Tool_3_0_19_Release_Notes.pdf

以下に,ESXi から Firmware Update した際の顛末を纏めておきました.


1. Intel SSD Data Center Tool の Install

ESXi 版が用意されています.

ダウンロード インテル® SSD データセンター・ツール(インテル® SSD DCT)

Download したら checksum を確認して,解凍します.

[root@fqdn:/path-to-dir] md5sum ./Intel_SSD_Data_Center_Tool_3.0.19_ESXi.zip 
dad044d7f1ed06f5254dcd77009cc554  ./Intel_SSD_Data_Center_Tool_3.0.19_ESXi.zip
[root@fqdn:/path-to-dir] unzip ./Intel_SSD_Data_Center_Tool_3.0.19_ESXi.zip 
Archive:  ./Intel_SSD_Data_Center_Tool_3.0.19_ESXi.zip
  inflating: Intel_SSD_Data_Center_Tool_3.0.19_ESXi/Intel_SSD_Data_Center_Tool_3_0_17_User_Guide-331961-018US.pdf
  inflating: Intel_SSD_Data_Center_Tool_3.0.19_ESXi/Intel_SSD_Data_Center_Tool_3_0_19_Release_Notes-330715-033US.pdf
  inflating: Intel_SSD_Data_Center_Tool_3.0.19_ESXi/Intel_SSD_Data_Center_Tool_Install_Guide_3_x_330713-005US.pdf
  inflating: Intel_SSD_Data_Center_Tool_3.0.19_ESXi/intel_ssd_data_center_tool-3.0.19-400.vib

Host's acceptance level を確認して,CommunitySupported に変更します.

[root@fqdn:/path-to-dir] esxcli software acceptance get
PartnerSupported
[root@fqdn:/path-to-dir] esxcli software acceptance set --level=CommunitySupported
Host acceptance level changed to 'CommunitySupported'.

Vib を Install します.

[root@fqdn:/path-to-dir] esxcli software vib install -v /your-path-to-vib/intel_ssd_data_center_tool-3.0.19-400.vib 
Installation Result
   Message: Operation finished successfully.
   Reboot Required: false
   VIBs Installed: INT_bootbank_intel_ssd_data_center_tool_3.0.19-400
   VIBs Removed: 
   VIBs Skipped:

/opt/intel/isdct ディレクトリが作成され,準備完了です.

[root@fqdn:/opt/intel/isdct] ls -al
total 10516
drwxr-xr-x    1 root     root           512 Apr 23 07:08 .
drwxr-xr-x    1 root     root           512 Apr 23 07:08 ..
drwxr-xr-x    1 root     root           512 Apr 23 07:08 FirmwareModules
-r-xr-xr-x    1 root     root       2472280 Mar 19 16:45 isdct
-r-xr-xr-x    1 root     root       8280240 Mar 19 16:45 libIntel.SSDFeatures.so.2.0.0

2. Firmware Update

早速 Device の確認をしてみると...

[root@fqdn:/opt/intel/isdct] ./isdct show -intelssd

No results

Raid Controller 配下の Device は参照できないようです orz. Blog なので,成功した事だけスマートに纏めたら良いのかもしれませんが,実際の現場はいつも試行錯誤ばかりなので,今回は失敗談も残しておきます(苦笑). Host's acceptance level を戻し,vib を削除して後始末完了.

[root@fqdn:/path-to-dir] esxcli software acceptance set --level=PartnerSupported
Host acceptance level changed to 'PartnerSupported'.
[root@fqdn:/path-to-dir] esxcli software vib remove -n intel_ssd_data_center_tool
Removal Result
   Message: Operation finished successfully.
   Reboot Required: false
   VIBs Installed: 
   VIBs Removed: INT_bootbank_intel_ssd_data_center_tool_3.0.19-400
   VIBs Skipped:

3. Storcli

という訳で,Storcli 経由で実施するしかないですね.気を取り直して Search results for 'storcli latest' から Latest の Storcli を入手して Install します.

[root@fqdn:/path-to-dir] esxcli software vib install --no-sig-check -v /your-path-to-vib/vmware-storcli.vib
Installation Result
   Message: Operation finished successfully.
   Reboot Required: false
   VIBs Installed: LSI_bootbank_vmware-storcli_007.0913.0000.0000-01
   VIBs Removed: 
   VIBs Skipped: 

まず,show コマンドで確認します.

[root@fqdn:~] /opt/lsi/storcli/storcli show
CLI Version = 007.0913.0000.0000 Jan 11, 2019
Operating system = VMkernel 6.7.0
Status Code = 0
Status = Success
Description = None

Number of Controllers = 2
Host Name = fqdn
Operating System  = VMkernel 6.7.0
StoreLib IT Version = 07.0906.0200.0000
StoreLib IR3 Version = 16.04-0

System Overview :
===============

------------------------------------------------------------------------------
Ctl Model             Ports PDs DGs DNOpt VDs VNOpt BBU sPR DS  EHS ASOs Hlth 
------------------------------------------------------------------------------
  0 AVAGO3108MegaRAID     8   2   1     0   1     0 N/A On  1&2 Y      3 Opt  
------------------------------------------------------------------------------


IT System Overview :
==================

-------------------------------------------------------------------------
Ctl Model      AdapterType   VendId DevId SubVendId SubDevId PCI Address 
-------------------------------------------------------------------------
  1 LSI3008-IR   SAS3008(C0) 0x1000  0x97    0x15D9    0x808 00:19:00:00 
-------------------------------------------------------------------------

今回 Firmware Update 対象の S4610 1.92TB は LSI3008 に接続されているので,c1 を指定して Drive を確認します. Enclosure 6 の Slot 2-5 までに挿さっていることが分かります.

[root@fqdn:~] /opt/lsi/storcli/storcli /c1/eall/sall show
(省略)
Drive Information :
=================

----------------------------------------------------------------------------
EID:Slt DID State DG       Size Intf Med SED PI SeSz Model               Sp 
----------------------------------------------------------------------------
6:0       0 UGood -  744.125 GB SAS  SSD N   N  512B HUSMM3280ASS200     U  
6:1       1 UGood -  744.125 GB SAS  SSD N   N  512B HUSMM3280ASS200     U  
6:2       2 UGood -    1.745 TB SATA SSD N   N  512B INTEL SSDSC2KG019T8 U  
6:3       3 UGood -    1.745 TB SATA SSD N   N  512B INTEL SSDSC2KG019T8 U  
6:4       4 UGood -    1.745 TB SATA SSD N   N  512B INTEL SSDSC2KG019T8 U  
6:5       5 UGood -    1.745 TB SATA SSD N   N  512B INTEL SSDSC2KG019T8 U  
----------------------------------------------------------------------------

個別に詳細を確認します.現在の FIrmware Revision は 0100 です.

[root@fqdn:~] /opt/lsi/storcli/storcli /c1/e6/s2 show all
(省略)
Drive /c1/e6/s2 Device attributes :
=================================
Manufacturer Id = ATA     
Model Number = INTEL SSDSC2KG019T8
NAND Vendor = NA
SN = PHYSERIALNUMBER
WWN = NA
Firmware Revision = 0100
(省略)

Intel 社のサイトから入手しておいた Firmware を Drive 毎に適用します.

[root@fqdn:~] /opt/lsi/storcli/storcli /c1/e6/s2 download src=/your-path-to-bin/XCV10110_XBUB0008_signed.bin
Starting microcode update.please wait...
CLI Version = 007.0913.0000.0000 Jan 11, 2019
Operating system = VMkernel 6.7.0
Controller = 1
Status = Success
Description = None

対象の Drive の Firmware を全て Update したら Host を再起動,その後 Firmware Revision を確認します.

[root@fqdn:~] /opt/lsi/storcli/storcli /c1/e6/s2 show all
(省略)
Drive /c1/e6/s2 Device attributes :
=================================
Manufacturer Id = ATA     
Model Number = INTEL SSDSC2KG019T8
NAND Vendor = NA
SN = PHYSERIALNUMBER
WWN = NA
Firmware Revision = 0110
(省略)

これで安心して来週の DTW に行けそうです.