Notifications
Clear all

What is Balancer in Hadoop?

RSS

(@anamika)
Noble Member
加入ed: 2 years ago
Posts: 1381
17/03/2021 12:00 pm
What does Balancer mean in Hadoop?

Quote
(@abhijith)
Noble Member
加入ed: 2 years ago
帖子:1350
17/03/2021 12:01 pm

InHadoop, the data which is stored within HDFS isn't always stored in a uniform fashion across the DataNode. The reason this inconsistency comes up is that DataNodes maybe added to the existing cluster and due to this addition, the existing data isn't automatically evenly distributed upon this addition.

After the addition of these DataNodes, the NameNodes was taken into consideration various points to determine which DataNode would be receiving these blocks of data:

  • Policies regarding data replication on one node of the same data block.
  • Policy to spread different data replicas across a rack so that the cluster can survive the loss of an entire rack
  • The data being replicated is put on the same rack as the node writing it to decrease I/O between different racks is reduced.
  • The data has to spread uniformly across various nodes.

ReplyQuote
Share:
Baidu