RAC INTERVIEW QUESTION and ANSWERS:
What are Oracle
Clusterware processes for 10g on Unix and Linux
Cluster Synchronization Services (ocssd)
— Manages cluster node membership and runs as the oracle user; failure of
this process results in cluster restart.
Cluster Ready Services (crsd) — The
crs process manages cluster resources (which could be a database, an instance,
a service, a Listener, a virtual IP (VIP) address, an application process, and
so on) based on the resource's configuration information that is stored in the
OCR. This includes start, stop, monitor and failover operations. This process
runs as the root user
Event manager daemon (evmd) —A
background process that publishes events that crs creates.
Process Monitor Daemon (OPROCD) —This
process monitor the cluster and provide I/O fencing. OPROCD performs its check,
stops running, and if the wake up is beyond the expected time, then OPROCD
resets the processor and reboots the node. An OPROCD failure results in Oracle
Clusterware restarting the node. OPROCD uses the hangcheck timer on Linux
platforms.
RACG (racgmain, racgimon) —Extends
clusterware to support Oracle-specific requirements and complex resources. Runs
server callout scripts when FAN events occur.
What are Oracle database background
processes specific to RAC
•LMS—Global Cache Service Process
•LMD—Global Enqueue Service Daemon
•LMON—Global Enqueue Service Monitor
•LCK0—Instance Enqueue Process
To ensure that each Oracle RAC database instance obtains the block that it
needs to satisfy a query or transaction, Oracle RAC instances use two
processes, the Global Cache Service (GCS) and the Global Enqueue Service (GES).
The GCS and GES maintain records of the statuses of each data file and each
cached block using a Global Resource Directory (GRD). The GRD contents are distributed
across all of the active instances.
What are Oracle Clusterware Components
Voting Disk — Oracle RAC uses the
voting disk to manage cluster membership by way of a health check and
arbitrates cluster ownership among the instances in case of network failures.
The voting disk must reside on shared disk.
Oracle Cluster Registry (OCR) —
Maintains cluster configuration information as well as configuration
information about any cluster database within the cluster. The OCR must reside
on shared disk that is accessible by all of the nodes in your cluster
How do you troubleshoot node reboot
Please check metalink ...
Note 265769.1 Troubleshooting CRS Reboots
Note.559365.1 Using Diagwait as a diagnostic to get more information for
diagnosing Oracle Clusterware Node evictions.
How do you backup the OCR
There is an automatic backup mechanism for OCR. The default location is :
$ORA_CRS_HOME\cdata\"clustername"\
To display backups :
#ocrconfig -showbackup
To restore a backup :
#ocrconfig -restore
With Oracle RAC 10g Release 2 or later, you can also use the export command:
#ocrconfig -export -s online, and use -import option to restore the contents
back.
With Oracle RAC 11g Release 1, you can do a manaual backup of the OCR with the
command:
# ocrconfig -manualbackup
How do you backup voting disk
#dd if=voting_disk_name of=backup_file_name
How do I identify the voting disk
location
#crsctl query css votedisk
How do I identify the OCR file location
check /var/opt/oracle/ocr.loc or /etc/ocr.loc ( depends upon platform)
or
#ocrcheck
Is ssh required for normal Oracle RAC
operation ?
"ssh" are not required for normal Oracle RAC operation. However
"ssh" should be enabled for Oracle RAC and patchset installation.
What is SCAN?
Single Client Access Name (SCAN) is s a new Oracle Real Application Clusters
(RAC) 11g Release 2 feature that provides a single name for clients to access
an Oracle Database running in a cluster. The benefit is clients using SCAN do
not need to change if you add or remove nodes in the cluster.
Click
here for more details from Oracle
What is the purpose of Private Interconnect
?
Clusterware uses the private interconnect for cluster synchronization (network
heartbeat) and daemon communication between the the clustered nodes. This
communication is based on the TCP protocol.
RAC uses the interconnect for cache fusion (UDP) and inter-process
communication (TCP). Cache Fusion is the remote memory mapping of Oracle
buffers, shared between the caches of participating nodes in the cluster.
Why do we have a Virtual IP (VIP) in Oracle
RAC?
Without using VIPs or FAN, clients connected to a node that died will often
wait for a TCP timeout period (which can be up to 10 min) before getting an
error. As a result, you don't really have a good HA solution without using
VIPs.
When a node fails, the VIP associated with it is automatically failed over to
some other node and new node re-arps the world indicating a new MAC address for
the IP. Subsequent packets sent to the VIP go to the new node, which will send
error RST packets back to the clients. This results in the clients getting
errors immediately.
What do you do if you see GC CR BLOCK LOST
in top 5 Timed Events in AWR Report?
This is most likely due to a fault in interconnect network.
Check netstat -s
if you see "fragments dropped" or "packet reassemblies
failed" , Work with your system administrator find the fault with network.
How many nodes are supported in a RAC
Database?
10g Release 2, support 100 nodes in a cluster using Oracle Clusterware, and 100
instances in a RAC database.
Srvctl cannot start instance, I get the
following error PRKP-1001 CRS-0215, however sqlplus can start it on both nodes?
How do you identify the problem?
Set the environmental variable SRVM_TRACE to true.. And start the instance with
srvctl. Now you will get detailed error stack.
what is the purpose of the ONS daemon?
The Oracle Notification Service (ONS) daemon is an daemon started by the CRS
clusterware as part of the nodeapps. There is one ons daemon started per
clustered node.
The Oracle Notification Service daemon receive a subset of published
clusterware events via the local evmd and racgimon clusterware daemons and
forward those events to application subscribers and to the local listeners.
This in order to facilitate:
a. the FAN or Fast Application Notification feature or allowing applications to
respond to database state changes.
b. the 10gR2 Load Balancing Advisory, the feature that permit load balancing
accross different rac nodes dependent of the load on the different nodes. The
rdbms MMON is creating an advisory for distribution of work every 30seconds and
forward it via racgimon and ONS to listeners and applications.
No comments:
Post a Comment