AHF upgrade doc v1.0

 # mkdir -p /opt/ahf_media

# cd /opt/ahf_media

# unzip AHF-LINUX_v21.4.3.zip

[root@ccnbdred01vm01 ahf_media]# ll

total 883508

-rw-r--r-- 1 root root 448081998 Aug 24 11:34 AHF-LINUX_v21.4.3.zip

-r-xr-xr-x 1 root root 456613102 Apr 21 22:33 ahf_setup

-rw-r--r-- 1 root root       384 Apr 21 22:34 ahf_setup.dat

-rw-r--r-- 1 root root       625 Apr 21 22:34 oracle-tfa.pub

-rw-r--r-- 1 root root      1525 Apr 21 22:34 README.txt

[root@ccnbdred01vm01 ahf_media]#

[root@ccnbdred01vm01 ahf_media]#

[root@ccnbdred01vm01 ahf_media]# ./ahf_setup


AHF Installer for Platform Linux Architecture x86_64


AHF Installation Log : /tmp/ahf_install_214300_55626_2022_08_24-11_46_02.log


Starting Autonomous Health Framework (AHF) Installation


AHF Version: 21.4.3 Build Date: 202203041749


AHF is already installed at /opt/oracle.ahf


Installed AHF Version: 21.4.1 Build Date: 202201112133


Do you want to upgrade AHF [Y]|N : Y


AHF will also be installed/upgraded on these Cluster Nodes :


1. ccnbdred02vm01


The AHF Location and AHF Data Directory must exist on the above nodes

AHF Location : /opt/oracle.ahf

AHF Data Directory : /u01/app/grid/oracle.ahf/data


Do you want to install/upgrade AHF on Cluster Nodes ? [Y]|N : Y


Upgrading /opt/oracle.ahf


Shutting down AHF Services

TFA-00201 Diagnostic directory not found.

Shutting down TFA

Removed symlink /etc/systemd/system/multi-user.target.wants/oracle-tfa.service.

Removed symlink /etc/systemd/system/graphical.target.wants/oracle-tfa.service.

Successfully shutdown TFA..

Telemetry adapter is not running


Starting AHF Services

Starting TFA..

Created symlink from /etc/systemd/system/multi-user.target.wants/oracle-tfa.service to /etc/systemd/system/oracle-tfa.service.

Created symlink from /etc/systemd/system/graphical.target.wants/oracle-tfa.service to /etc/systemd/system/oracle-tfa.service.

Waiting up to 100 seconds for TFA to be started..

. . . . .

Successfully started TFA Process..

. . . . .

TFA Started and listening for commands

TFA-00201 Diagnostic directory not found.



INFO: Starting exachk scheduler in background. Details for the process can be found at /u01/app/grid/oracle.ahf/data/ccnbdred01vm01/diag/exachk/compliance_start_240822_114735.log



AHF upgrade completed on ccnbdred01vm01


Upgrading AHF on Remote Nodes :


AHF will be installed on ccnbdred02vm01, Please wait.


Please Enter the password for ccnbdred02vm01 :


Is password same for all the nodes? [Y]|N : Y


Upgrading AHF on ccnbdred02vm01 :


[ccnbdred02vm01] Copying AHF Installer


[ccnbdred02vm01] Running AHF Installer


Do you want AHF to store your My Oracle Support Credentials for Automatic Upload ? Y|[N] : N


AHF is successfully upgraded to latest version


.----------------------------------------------------------------------.

| Host           | TFA Version | TFA Build ID         | Upgrade Status |

+----------------+-------------+----------------------+----------------+

| ccnbdred01vm01 |  21.4.3.0.0 | 21430020220304174951 | UPGRADED       |

| ccnbdred02vm01 |  21.4.3.0.0 | 21430020220304174951 | UPGRADED       |

'----------------+-------------+----------------------+----------------'


Moving /tmp/ahf_install_214300_55626_2022_08_24-11_46_02.log to /u01/app/grid/oracle.ahf/data/ccnbdred01vm01/diag/ahf/


[root@ccnbdred01vm01 ahf_media]#

===============================================


# /opt/oracle.ahf

# cd bin


[root@ccnbdred01vm01 bin]# ./ahfctl -help

Usage : ahfctl <command> [options]

        | applypatch

        | applyupdate

        | celldiagcollect

        | checkpassword

        | checkupload

        | compliance

        | deletebackup

        | get

        | getosgroup

        | getresourcelimit

        | getupgrade

        | getupload

        | import

        | loadpolicy

        | printresourcestats

        | querypatch

        | queryupdate

        | redact

        | rmap

        | rollbackpatch

        | rollbackupdate

        | set

        | setosgroup

        | setpassword

        | setresourcelimit

        | setupgrade

        | setupload

        | showrepo

        | startahf

        | statusahf

        | stopahf

        | uninstall

        | unloadpolicy

        | unset

        | unsetpassword

        | unsetresourcelimit

        | unsetupgrade

        | unsetupload

        | upgrade

        | upload

        | version


For detailed help on each command use: ahfctl <command> -h


[root@ccnbdred01vm01 bin]#

================================================================


# ./ahfctl compliance -a


[root@ccnbdred01vm01 bin]# ./ahfctl showrepo


.-----------------------------------------------------.

|                    ccnbdred02vm01                   |

+----------------------+------------------------------+

| Repository Parameter | Value                        |

+----------------------+------------------------------+

| Location             | /u01/app/grid/tfa/repository |

| Maximum Size (MB)    | 10210                        |

| Current Size (MB)    | 0                            |

| Free Size (MB)       | 10210                        |

| Status               | OPEN                         |

'----------------------+------------------------------'


.-----------------------------------------------------.

|                    ccnbdred01vm01                   |

+----------------------+------------------------------+

| Repository Parameter | Value                        |

+----------------------+------------------------------+

| Location             | /u01/app/grid/tfa/repository |

| Maximum Size (MB)    | 10205                        |

| Current Size (MB)    | 0                            |

| Free Size (MB)       | 10205                        |

| Status               | OPEN                         |

'----------------------+------------------------------'


exachk repository: /u01/app/grid/oracle.ahf/data/ccnbdred01vm01/exachk/user_root/output

================================================================

[root@ccnbdred01vm01 bin]# ./ahfctl statusahf



.------------------------------------------------------------------------------------------------------.

| Host           | Status of TFA | PID   | Port | Version    | Build ID             | Inventory Status |

+----------------+---------------+-------+------+------------+----------------------+------------------+

| ccnbdred01vm01 | RUNNING       | 62521 | 5000 | 21.4.3.0.0 | 21430020220304174951 | COMPLETE         |

| ccnbdred02vm01 | RUNNING       | 38005 | 5000 | 21.4.3.0.0 | 21430020220304174951 | COMPLETE         |

'----------------+---------------+-------+------+------------+----------------------+------------------'


------------------------------------------------------------


Master node = ccnbdred01vm01


exachk daemon version = 21.4.3


Install location = /opt/oracle.ahf/exachk


Started at = Wed Aug 24 11:47:53 PKT 2022


Scheduler type = TFA Scheduler


Scheduler PID:  62521


------------------------------------------------------------

ID: exachk.autostart_client_exatier1

------------------------------------------------------------

AUTORUN_FLAGS  =  -usediscovery -profile exatier1 -dball -showpass -tag autostart_client_exatier1 -readenvconfig

COLLECTION_RETENTION  =  7

AUTORUN_SCHEDULE  =  3 2 * * 1,2,3,4,5,6

------------------------------------------------------------

------------------------------------------------------------

ID: exachk.autostart_client

------------------------------------------------------------

AUTORUN_FLAGS  =  -usediscovery -tag autostart_client -readenvconfig

COLLECTION_RETENTION  =  14

AUTORUN_SCHEDULE  =  3 3 * * 0

------------------------------------------------------------


Next auto run starts on Aug 25, 2022 02:03:00


ID:exachk.AUTOSTART_CLIENT_EXATIER1


[root@ccnbdred01vm01 bin]#


================================================================

[root@ccnbdred01vm01 bin]# ./tfactl -help

Usage : tfactl <command> [options]


commands : compliance|celldiagcollect|applyupdate|queryupdate|rollbackupdate|applypatch|querypatch|rollbackpatch|startahf|stopahf|statusahf|uninstall|version|showrepo|loadpolicy|unloadpolicy|import|set|get|unset|setupload|getupload|unsetupload|checkupload|upload|setpassword|unsetpassword|checkpassword|upgrade|setupgrade|unsetupgrade|getupgrade|redact|rmap|setosgroup|getosgroup|deletebackup|setresourcelimit|getresourcelimit|unsetresourcelimit|printresourcestats


For detailed help on each command use: tfactl <command> -h



Usage : /opt/oracle.ahf/tfa/bin/tfactl <command> [options]

    commands:diagcollect|collection|analyze|ips|run|start|stop|enable|disable|status|print|access|purge|directory|host|set|cell|toolstatus|uninstall|diagnosetfa|syncnodes|upload|availability|rest|dbcheck|events|search|changes|isa|blackout|rediscover|modifyprofile|refreshconfig|get|version|floodcontrol|queryindex|index|purgeindex|purgeinventory

For detailed help on each command use:

  /opt/oracle.ahf/tfa/bin/tfactl <command> -help


[root@ccnbdred01vm01 bin]#

================================================================


[root@ccnbdred01vm01 bin]# ./tfactl diagcollect


TFA will collect diagnostics for the last 1 hour(s).

Please enter the time of the incident [YYYY-MM-DD HH24:MI:SS], or <RETURN> to collect for the last 1 hour(s). (Q|q to Quit):



Collecting data for the last 1 hours for all components...

Collecting data for all nodes


TFA is using system timezone for collection, All times shown in PKT.


Collection Id : 20220824121150ccnbdred01vm01


Detailed Logging at : /u01/app/grid/tfa/repository/collection_Wed_Aug_24_12_11_53_PKT_2022_node_all/diagcollect_20220824121150_ccnbdred01vm01.log


Collection Failed. Could not read diagcollect log /u01/app/grid/tfa/repository/collection_Wed_Aug_24_12_11_53_PKT_2022_node_all/diagcollect_console_20220824121150_ccnbdred01vm01.log within 60 seconds

[root@ccnbdred01vm01 bin]#


=================================================================

[root@ccnbdred01vm01 bin]# ./exachk



Searching for running databases . . . . .


.  .

List of running databases registered in OCR


1. GGT26

2. None of above


Select databases from list for checking best practices. For multiple databases, select 1 for All or comma separated number like 1,2 etc [1-2][1]. 1

.  .  .  .  .  .



Either Cluster Verification Utility pack (cvupack) does not exist at /opt/oracle.ahf/common/cvu or it is an old or invalid cvupack


Checking Cluster Verification Utility (CVU) version at CRS Home - /u01/app/19.0.0.0/grid


This version of Cluster Verification Utility (CVU) was released on 06-Jan-2020 and it is older than 180 days. It is highly recommended that you download the latest version of CVU from MOS patch 30839369 to ensure the highest level of accuracy of the data contained within the report


Do you want to download latest version of Cluster Verification Utility (CVU) from my oracle support? [y/n] [y] n


Running older version of Cluster Verification Utility (CVU) from CRS Home - /u01/app/19.0.0.0/grid



Starting to run exachk in background on ccnbdred02vm01 using socket


.  .  .  .

.  .


Checking Status of Oracle Software Stack - Clusterware, ASM, RDBMS on ccnbdred01vm01


.  .  . . . .

.  .  . . . .  .  .  .  .  .  .  .

-------------------------------------------------------------------------------------------------------

                                                 Oracle Stack Status

-------------------------------------------------------------------------------------------------------

  Host Name       CRS Installed  RDBMS Installed    CRS UP    ASM UP  RDBMS UP    DB Instance Name

-------------------------------------------------------------------------------------------------------

ccnbdred01vm01             Yes          Yes          Yes      Yes      Yes              GGT261

-------------------------------------------------------------------------------------------------------

.

.  .  .  .  .  .



.

.

.

.


.



*** Checking Best Practice Recommendations ( Pass / Warning / Fail ) ***


.


Collections and audit checks log file is

/u01/app/grid/oracle.ahf/data/ccnbdred01vm01/exachk/user_root/output/exachk_ccnbdred01vm01_GGT26_082422_120222/log/exachk.log


============================================================

         Node name - ccnbdred01vm01

============================================================

. . . . . .


 Collecting - ASM Disk Group for Infrastructure Software and Configuration

 Collecting - ASM Diskgroup Attributes

 Collecting - ASM diskgroup usable free space

 Collecting - ASM initialization parameters

 Collecting - CELL ID Info

 Collecting - Database Parameters for GGT26 database

 Collecting - Database Undocumented Parameters for GGT26 database

 Collecting - CPU Information

 Collecting - Clusterware and RDBMS software version

 Collecting - Compute node PCI bus slot speed for RDMA Network Fabric HCAs

 Collecting - Kernel parameters

 Collecting - Maximum number of semaphore sets on system

 Collecting - Maximum number of semaphores on system

 Collecting - OS Packages

 Collecting - Patches for Grid Infrastructure

 Collecting - Patches for RDBMS Home

 Collecting - RDBMS and GRID software owner UID across cluster

 Collecting - RDBMS patch inventory

 Collecting - number of semaphore operations per semop system call


Starting to run root collections in background


 Collecting - CRS user time zone check

 Collecting - Exadata Critical Issue EX33

 Collecting - Exadata Critical Issue EX57

 Collecting - Clusterware patch inventory

 Collecting - CRS user limits configuration

 Collecting - Collect ksplice fixes [Database Server]

 Collecting - Exadata Critical Issue EX56

 Collecting - Exadata Critical Issue DB09

 Collecting - Exadata critical issue EX50

 Collecting - Exadata Critical Issue EX64

 Collecting - Exadata Critical Issue EX58

 Collecting - Infrastructure Software and Configuration for compute

 Collecting - Exadata critical issue EX55

 Collecting - HCA firmware version on database server

 Collecting - Exadata version on database server

 Collecting - Exadata critical issue EX67

 Collecting - HCA transfer rate on database server

 Collecting - Verify IP routing configuration on database servers

 Collecting - Obtain hardware information

 Collecting - Verify RoCE Interfaces Status [Database Server]

 Collecting - OFED Software version on database server

 Collecting - Verify Quorum disks configuration

 Collecting - Verify Data Network is Separate from Management Network

 Collecting - Operating system and Kernel version on database server

 Collecting - MaxStartups setting in sshd_config

 Collecting - Validate key sysctl.conf parameters on database servers

 Collecting - System Event Log

 Collecting - Oracle monitoring agent and/or OS settings on ADR diagnostic directories

 Collecting - Verify available ksplice fixes are installed [Database Server]

 Collecting - Verify active kernel version matches expected version for installed Exadata Image

 Collecting - Verify TCP Selective Acknowledgement is enabled

 Collecting - Verify database server file systems have Maximum mount count = -1

 Collecting - Verify the storage servers in use configuration matches across the cluster

 Collecting - Verify imageinfo on database server to compare systemwide

 Collecting - Verify installed rpm(s) kernel type match the active kernel version

 Collecting - Verify no database server kernel out of memory errors

 Collecting - Verify imageinfo on database server

 Collecting - Verify proper ACFS drivers are installed for Spectre v2 mitigation

 Collecting - Verify the Name Service Cache Daemon (NSCD) configuration

 Collecting - Verify the vm.min_free_kbytes configuration

 Collecting - collect time server data [Database Server]

 Collecting - root time zone check


Data collections completed. Checking best practices on ccnbdred01vm01.

------------------------------------------------------------


 INFO =>     Oracle GoldenGate failure prevention best practices

 INFO =>     One or more non-default AWR baselines should be created for GGT26

 FAIL =>     Automatic Storage Management Cluster File System (ACFS) filesystems and critical database files are in the same ASM disk group

 INFO =>     Please refer to data and guidance provided for database parameter processes for GGT26

 WARNING =>  Database parameter DB_BLOCK_CHECKING on primary is not set to the recommended value. for GGT26

 FAIL =>     Flashback on primary is not configured for GGT26

 INFO =>     Operational Best Practices

 INFO =>     Database Consolidation Best Practices

 INFO =>     Computer failure prevention best practices

 INFO =>     Data corruption prevention best practices

 INFO =>     Logical corruption prevention best practices

 INFO =>     Database/Cluster/Site failure prevention best practices

 INFO =>     Client failover operational best practices

 FAIL =>     v$asm_disk os_mb and total_mb values are not the same for one or more disks

 INFO =>     Database failure prevention best practices

 FAIL =>     Primary database is not protected with Data Guard (standby database) for real-time data protection and availability for GGT26

 INFO =>     Storage failures prevention best practices

 INFO =>     Software maintenance best practices

 FAIL =>     FRA space management problem file types are present without an RMAN backup completion within the last 7 days for GGT26

 INFO =>     Oracle recovery manager(rman) best practices

 INFO =>     Database feature usage statistics for GGT26


Copying results from ccnbdred02vm01 and generating report. This might take a while. Be patient.


============================================================

         Node name - ccnbdred02vm01

============================================================

. . . . . .


 Collecting - ASM Disk Group for Infrastructure Software and Configuration

 Collecting - ASM Diskgroup Attributes

 Collecting - ASM diskgroup usable free space

 Collecting - ASM initialization parameters

 Collecting - CELL ID Info

 Collecting - Database Parameters for GGT26 database

 Collecting - Database Undocumented Parameters for GGT26 database

 Collecting - CPU Information

 Collecting - Clusterware and RDBMS software version

 Collecting - Compute node PCI bus slot speed for RDMA Network Fabric HCAs

 Collecting - Kernel parameters

 Collecting - Maximum number of semaphore sets on system

 Collecting - Maximum number of semaphores on system

 Collecting - OS Packages

 Collecting - Patches for Grid Infrastructure

 Collecting - Patches for RDBMS Home

 Collecting - RDBMS and GRID software owner UID across cluster

 Collecting - RDBMS patch inventory

 Collecting - number of semaphore operations per semop system call


Starting to run root collections in background


 Collecting - Exadata Critical Issue EX57

 Collecting - Exadata Critical Issue DB09

 Collecting - Exadata Critical Issue EX58

 Collecting - CRS user limits configuration

 Collecting - Clusterware patch inventory

 Collecting - Collect ksplice fixes [Database Server]

 Collecting - Exadata Critical Issue EX33

 Collecting - Exadata Critical Issue EX56

 Collecting - CRS user time zone check

 Collecting - Exadata critical issue EX55

 Collecting - Exadata Critical Issue EX64

 Collecting - Exadata critical issue EX50

 Collecting - OFED Software version on database server

 Collecting - Exadata version on database server

 Collecting - HCA transfer rate on database server

 Collecting - Infrastructure Software and Configuration for compute

 Collecting - Exadata critical issue EX67

 Collecting - MaxStartups setting in sshd_config

 Collecting - HCA firmware version on database server

 Collecting - Verify Quorum disks configuration

 Collecting - Verify TCP Selective Acknowledgement is enabled

 Collecting - Verify Data Network is Separate from Management Network

 Collecting - Validate key sysctl.conf parameters on database servers

 Collecting - Verify active kernel version matches expected version for installed Exadata Image

 Collecting - Verify RoCE Interfaces Status [Database Server]

 Collecting - Operating system and Kernel version on database server

 Collecting - System Event Log

 Collecting - Verify database server file systems have Maximum mount count = -1

 Collecting - Verify IP routing configuration on database servers

 Collecting - Oracle monitoring agent and/or OS settings on ADR diagnostic directories

 Collecting - Obtain hardware information

 Collecting - Verify available ksplice fixes are installed [Database Server]

 Collecting - Verify installed rpm(s) kernel type match the active kernel version

 Collecting - Verify the storage servers in use configuration matches across the cluster

 Collecting - Verify no database server kernel out of memory errors

 Collecting - Verify imageinfo on database server

 Collecting - Verify the Name Service Cache Daemon (NSCD) configuration

 Collecting - collect time server data [Database Server]

 Collecting - Verify the vm.min_free_kbytes configuration

 Collecting - Verify imageinfo on database server to compare systemwide

 Collecting - root time zone check

 Collecting - Verify proper ACFS drivers are installed for Spectre v2 mitigation


Data collections completed. Checking best practices on ccnbdred02vm01.

------------------------------------------------------------


 INFO =>     Oracle GoldenGate failure prevention best practices

 INFO =>     Please refer to data and guidance provided for database parameter processes for GGT26

 WARNING =>  Database parameter DB_BLOCK_CHECKING on primary is not set to the recommended value. for GGT26


------------------------------------------------------------

                      CLUSTERWIDE CHECKS

------------------------------------------------------------


------------------------------------------------------------

Detailed report (html) -  /u01/app/grid/oracle.ahf/data/ccnbdred01vm01/exachk/user_root/output/exachk_ccnbdred01vm01_GGT26_082422_120222_2123XCA014/exachk_ccnbdred01vm01_GGT26_082422_120222_2123XCA014.html



UPLOAD [if required] - /u01/app/grid/oracle.ahf/data/ccnbdred01vm01/exachk/user_root/output/exachk_ccnbdred01vm01_GGT26_082422_120222_2123XCA014.zip


[root@ccnbdred01vm01 bin]#




Post a Comment

0 Comments