MAPREDUCE-7282. Move away from V2 commit algorithm #2349

steveloughran · 2020-09-29T16:58:50Z

Default to v1 commit algorithm
log at WARN on job set up when v2 is used
use a special log for that so people who know what they are doing can
tell it to be quiet

Supercedes #2320

* Default to v1 commit algorithm * log at WARN on job set up when v2 is used * use a special log for that so people who know what they are doing can tell it to be quiet Change-Id: I9922cb6b86997e027870c6a445b715eb3ff5a39e

jbrennan333

Thanks for the updated PR @steveloughran!

...ce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/output/FileOutputCommitter.java

jbrennan333 · 2020-09-29T18:38:59Z

...ce-client-core/src/main/java/org/apache/hadoop/mapreduce/lib/output/FileOutputCommitter.java

+      log.warn("The v2 commit algorithm is deprecated;"
+          + " please switch to the v1 algorithm");


I don't think we should use the word deprecated. That implies that this algorithm will be removed in a future release

what do you suggest?

switching to your text

jbrennan333 · 2020-09-29T18:42:39Z

...t/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources/mapred-default.xml

+  The version 2 algorithm is deprecated and no longer the default
+  as task commits were not atomic.


Similarly, remove "deprecated".

jbrennan333 · 2020-09-29T18:45:02Z

...t/hadoop-mapreduce-client/hadoop-mapreduce-client-core/src/main/resources/mapred-default.xml

+  If a first task attempt fails part-way
+  through its task commit, the output directory could end up
+  with data from that failed commit, alongside the data
+  from any subsequent attempts.
+
+  See https://issues.apache.org/jira/browse/MAPREDUCE-7282
+
+  Although no-longer the default, this algorithm is safe to use if
+  all task attempts for a single task meet the following requirements
+  -they generate exactly the same set of files
+  -the contents of each file are exactly the same in each task attempt
+
+  That is:
+  1. If a second attempt commits work, there will be no leftover files from
+  a first attempt which failed during its task commit.
+  2. If a network partition causes the first task attempt to overwrite
+  some/all of the output of a second attempt, the result will be
+  exactly the same as if it had not done so.
+
+  To avoid the warning message on job setup, set the log level of the log
+  org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter.Algorithm
+  to ERROR.


I think this section should be moved to the end of the Algorithm 2 section below. You can add (see below for details) to the end of the line that says why algorithm v2 in no longer the default.

steveloughran · 2020-10-01T13:32:22Z

@jbrennan333 what do you think we should say instead of deprecated? "not recommended".

I was thinking of adding a link to the JIRA and changing the issue text there to clarify

safe if names and content of generated output files is consistent across all task attempts
unsafe if different TAs generate bad files (biggest risk, as partial failure of 1st attempt may leave)
unsafe if different TAs generate different content in same files (only an issue on a network partition and TA MAPREDUCE-6096.SummarizedJob Class Improvment #1 generates output as/after TA YARN-1964 Launching containers from docker #2 does its work.

cleanup of job will delete the whole job attempt dir so that's the maximum time that a partitioned TA may commit work. There's no risk of some VM pausing for 3 hours, restarting and an in progress TA completing its work and overwriting the final output. This is good.

steveloughran · 2020-10-01T14:09:04Z

(Yetus failure is from no new tests)

jbrennan333 · 2020-10-01T14:31:42Z

@steveloughran It's hard to think of a terse warning for this. I think your comment above gets close. Maybe something like "The v2 commit algorithm assumes that the content of generated output files is consistent across all task attempts - if this is not true for this job, the v1 commit algorithm is strongly recommended."

Change-Id: I2a069cc633e24c559311d69b5c0064aaa88a4f3b

jbrennan333

Thanks for the update @steveloughran. The code changes look good to me, but we need to resolve whether the default should be actually be changed - see @daryn-sharp's -1 in the Jira.

jiwq

LGTM Thanks @steveloughran for the work. But I'm not sure whether we should discuss in the mail list.

steveloughran · 2021-06-10T20:58:37Z

Once the manifest committer #2971 is in, the abfs and gcs stores will get something faster than v1 but with its task failure semantics

all treewalking for task attempt listing will be in task commit
no dir renames in task commit, just saving of the manifest
job commit: parallel load of manifests, merge of list of directories to create, parallel set of mkdirs and then the parallelized renames.

In this world, we can just leave people using v1/v2 alone, and for (spark) jobs in azure and google cloud say "use the manifest committer".
Which means I can just close this as a wontfix.

MAPREDUCE-7282. Move away from V2 commit algorithm

f5b5055

* Default to v1 commit algorithm * log at WARN on job set up when v2 is used * use a special log for that so people who know what they are doing can tell it to be quiet Change-Id: I9922cb6b86997e027870c6a445b715eb3ff5a39e

jbrennan333 requested changes Sep 29, 2020

View reviewed changes

MAPREDUCE-7282 feedback on message and comments

f13d9b6

Change-Id: I2a069cc633e24c559311d69b5c0064aaa88a4f3b

jbrennan333 reviewed Oct 2, 2020

View reviewed changes

jiwq reviewed Oct 8, 2020

View reviewed changes

steveloughran closed this Jul 5, 2021

steveloughran deleted the BUG/MAPREDUCE-7282-MRv2-warn branch October 15, 2021 19:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MAPREDUCE-7282. Move away from V2 commit algorithm #2349

MAPREDUCE-7282. Move away from V2 commit algorithm #2349

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		log.warn("The v2 commit algorithm is deprecated;"
		+ " please switch to the v1 algorithm");

		The version 2 algorithm is deprecated and no longer the default
		as task commits were not atomic.

MAPREDUCE-7282. Move away from V2 commit algorithm #2349

MAPREDUCE-7282. Move away from V2 commit algorithm #2349

Uh oh!

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!