• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer navigation

The Geek Diary

  • OS
    • Linux
    • CentOS/RHEL
    • Solaris
    • Oracle Linux
    • VCS
  • Interview Questions
  • Database
    • oracle
    • oracle 12c
    • ASM
    • mysql
    • MariaDB
  • DevOps
    • Docker
    • Shell Scripting
  • Big Data
    • Hadoop
    • Cloudera
    • Hortonworks HDP

FATAL: Error inserting rds_rdma

by admin

The Problem

RDS module is not loading after rebooting the nodes of the cluster. Therefore, the CRS cannot run on any of the nodes. When trying to load the module the following errors are displayed:

# modprobe rds_rdma
FATAL: Error inserting rds_rdma
/lib/modules/2.6.18-274.18.1.0.1.el5/updates/net/rds/rds_rdma.ko): Unknown symbol in module, or unknown parameter (see dmesg)

dmesg output shows the following entries:

rds_rdma: Unknown symbol rds_cong_map_updated
rds_rdma: Unknown symbol rds_conn_drop
rds_rdma: Unknown symbol rds_message_addref
rds_rdma: Unknown symbol rds_trans_unregister
rds_rdma: Unknown symbol rds_info_deregister_func
rds_rdma: Unknown symbol rds_send_get_message
rds_rdma: Unknown symbol rds_for_each_conn_info
rds_rdma: Unknown symbol rds_message_add_rdma_dest_extension
rds_rdma: Unknown symbol rds_wq
rds_rdma: Unknown symbol rds_atomic_send_complete
rds_rdma: Unknown symbol rds_conn_connect_if_down
rds_rdma: Unknown symbol rds_conn_destroy

When this issue arises. The Cluster Synchronization Services (CSS) daemon “ccsd” will not start thus not allowing GI to fully start. The following entries are logged in Cluster Synchronization Services (CSS) daemon trace file “ocssd.trc”

2017-10-25 20:13:23.776120 : SKGFD:922437376: ERROR: -8(OS Error -1 (open,sskgxplp,Invalid protocol requested (2) or protocol not loaded.,Error 0)
2017-10-25 20:13:23.776127 : SKGFD:922437376: ERROR: -10(OSS Operation oss_initialize failed with error 4 [Network initialization failed]

The Solution

The issue is caused since the line “install rds /bin/true” which appears in the /etc/modprobe.d/network.conf file works similar to a blacklist of that module, but with higher precedence.

The solution to the problem is to perform any of the following actions, which the main objective is to get rid of the “install rds /bin/true” so that the module can load after every system reboot.

1. Remove the file /etc/modprobe.d/network.conf or move it to another directory like /tmp.

or

2. Commenting out the line in the /etc/modprobe.d/network.conf like in the example bellow

# install rds /bin/true

Then we can proceed just to reboot the system and make sure rds is loaded after reboot, if rds is not loaded then load the module by running:

# modprobe rds_rdma

or

Run the following commands:

# depmod -ae current_kernel_version_running  -------> for example 2.6.18-274.18.1.0.1.el5
# modprobe rds_rdma
# reboot

Once rds module is properly loaded CRS can be started on all nodes of the cluster.

Note: The file name under “/etc/modprobe.d/” for rds module depends on what the user want to use. ie: “network.conf” or “rds.conf”

Filed Under: CentOS/RHEL, CentOS/RHEL 5, CentOS/RHEL 6, oracle

Some more articles you might also be interested in …

  1. DNS and DHCP Setup Example for Oracle Grid Infrastructure GNS
  2. Oracle SQL script to Show current Users and SQL being Executed
  3. lvsd Command Examples in Linux
  4. KVM Virsh Command Examples on CentOS and RHEL
  5. How to see what has changed in Kernel Versions (changelog) in CentOS/RHEL
  6. How to Verify if a Disk/Partition is in Use by Oracle ASM, was used by Oracle ASM or is never used by Oracle ASM
  7. SSHFS (Secure SHell FileSystem) – Securely Mount remote filesystem over ssh
  8. “cannot install the best update candidate for package” – error on running dnf update
  9. “WARNING: Failed to connect to lvmetad. Falling back to device scanning” – error while running LVM commands
  10. Configuring Persistent Storage in CentOS/RHEL 5,6 for Single Path using udev rules

You May Also Like

Primary Sidebar

Recent Posts

  • What are /dev/zero and /dev/null files in Linux
  • grpck command – Remove corrupt or duplicate entries in the /etc/group and /etc/gshadow files.
  • xxd command – Expressed in hexadecimal form
  • sesearch: command not found

© 2022 · The Geek Diary

  • Archives
  • Contact Us
  • Copyright