0 votes
in HDFS by

What is a task instance in Hadoop and where does it run?

1 Answer

0 votes
by

Task instances are the actual MapReduce jobs which are run on each slave node. The TaskTracker starts a separate JVM processes to do the actual work (called as Task Instance) this is to ensure that process failure does not take down the task tracker. Each Task Instance runs on its own JVM process. There can be multiple processes of task instance running on a slave node. This is based on the number of slots configured on task tracker. By default a new task instance JVM process is spawned for a task.

Related questions

0 votes
asked Sep 1, 2022 in AWS by sharadyadav1986
0 votes
asked Jun 23, 2023 in HDFS by rajeshsharma
...