+1 vote
in Big Data | Hadoop by

How can we delete duplicate rows from flat files?

1 Answer

0 votes
by

We can delete duplicate rows from flat files by leveraging the sorter transformation and selecting the distinct option. Selecting this option will delete the duplicate rows.

...