Free Shipping

Secure Payment

easy returns

24/7 support

  • Home
  • Blog
  • Configuring Rack Awareness on Hadoop Using Centos

Configuring Rack Awareness on Hadoop Using Centos

 July 9  | 0 Comments

This blog gives you information about Rack Awareness in Apache Hadoop. HDFS block placement uses rack awareness for fault tolerance by placing one block replica on a different rack. This ensures data availability during network switch failure or a partition within the cluster. Rack awareness is very helpful in making an appropriate replication factor. Configuring rack awareness provides the information to Hadoop as to which Datanode is on which rack.

Note: For rack awareness configuration, all  the changes/modifications have to be made on NameNode (Masternode) only.

You can configure rack awareness in 3 steps:

1. Create a topology data file anywhere in Master node (NameNode)

Next, mention your slave nodes ( DataNodes) relative to their rack into topology.data-

 

2. Create a topology.sh  script file (Also called as rack awareness script file)

 

3. Add this property into core-site.xml of Master node only

 

Next, start your cluster.

 

Check the Hadoop admin report to see if the cluster is aware of the rack.

 

 

 

 

This cluster is now rack aware !

Hope this post was helpful in understanding about the Commissioning and Decommissioning of the datanodes in Hadoop.

In case of any queries, feel free to write to us at support@acadgild.com or comment below, and we will get back to you at the earliest.

>