0 votes
in Hadoop by
How do you benchmark your Hadoop Cluster with Hadoop tools?

1 Answer

0 votes
by
There are couple of benchmarking tools which come with Hadoop Distributions like TestDFSIO, nnbench, mrbench, TeraGen or TeraSort.

For Network bottlenecks and IO related performance issues TestDFSIO can be used to stress test the cluster.

NNBench can used to do load test on NameNode by creating, deleting, and renaming files on HDFS.

Once the cluster passes TestDFSIO tests, TeraSort benchmarking tool can be used to test the configuration. Yahoo used TeraSort and created a record of sorting 1PB of data in 16 hours on a

Related questions

+3 votes
asked Nov 25, 2022 in Hadoop by john ganales
0 votes
asked Nov 6, 2020 in Hadoop by rahuljain1
...