[FIX] immunize shutil.rmtree to node non-existence for remove_node_di… #3148

dPys · 2020-01-06T19:04:05Z

@effigies -- since it seems you're still actively working on the 1.4.x merge, I just went ahead and opened a fresh PR directly onto that branch since rebasing was causing issues...

dPys · 2020-01-06T19:04:56Z

;-)

effigies · 2020-01-06T19:10:30Z

Thanks. Do you think you could put together a small regression test?

codecov · 2020-01-06T21:20:26Z

Codecov Report

Merging #3148 into maint/1.4.x will increase coverage by <.01%.
The diff coverage is n/a.

@@              Coverage Diff               @@
##           maint/1.4.x   #3148      +/-   ##
==============================================
+ Coverage        67.59%   67.6%   +<.01%     
==============================================
  Files              299     299              
  Lines            39499   39499              
  Branches          5220    5220              
==============================================
+ Hits             26700   26703       +3     
+ Misses           12086   12081       -5     
- Partials           713     715       +2

Flag	Coverage Δ
#smoketests	`52.82% <ø> (-0.22%)`	⬇️
#unittests	`64.87% <ø> (+0.02%)`	⬆️

Impacted Files	Coverage Δ
nipype/conftest.py	`95.65% <ø> (ø)`	⬆️
nipype/utils/provenance.py	`82.58% <0%> (-1.3%)`	⬇️
nipype/pipeline/engine/nodes.py	`83.33% <0%> (-0.17%)`	⬇️
nipype/pipeline/engine/utils.py	`80.61% <0%> (+0.11%)`	⬆️
nipype/pipeline/plugins/base.py	`61.91% <0%> (+1.91%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 7dd3b37...b473b6b. Read the comment docs.

dPys · 2020-01-06T22:04:05Z

@effigies , Not really sure what you have in mind RE: a regression test, but let me know if this is approaching it:

@pytest.mark.regression
@pytest.mark.parametrize('remove_nodes', 
    [pytest.param('false', marks=pytest.mark.xfail), 'true']
)
def test_remove_nodes(tmpdir, remove_nodes):
    import os
    import nipype.interfaces.utility as niu
    import nipype.pipeline.engine as pe

    wf = pe.Workflow(name='test')

    def func(arg1):
        try:
            if arg1 == 2:
                raise Exception('arg cannot be ' + str(arg1))
        except:
            pass
        return arg1

    funkynode = pe.MapNode(niu.Function(function=func, input_names=['arg1'],
                                        output_names=['out']),
                           iterfield=['arg1'],
                           name = 'functor')
    funkynode.inputs.arg1 = [1,2]

    wf.add_nodes([funkynode])
    wf.base_dir = tmpdir
    wf.config['execution']['remove_node_directories'] = remove_nodes
    wf.config['execution']['stop_on_first_crash'] = 'false'
    wf.config['logging']['crashdump_dir'] = tmpdir
    res = wf.run(plugin='MultiProc')
    
    assert os.path.isdir(Path(tmpdir)/'test/functor') is False

effigies · 2020-01-06T22:33:46Z

Yup, something of the sort. You'll want to make sure that the test fails before your fix and passes after it.

You can use the tmp_path pytest fixture to avoid polluting /tmp, but if you're not very familiar with pytest, go ahead and code up a test, and I can make suggestions to clean it up.

dPys · 2020-01-06T23:40:37Z

Yup, something of the sort. You'll want to make sure that the test fails before your fix and passes after it.

You can use the tmp_path pytest fixture to avoid polluting /tmp, but if you're not very familiar with pytest, go ahead and code up a test, and I can make suggestions to clean it up.

With the exception of the regression, the above update should be a bit closer to what we want. Not sure how you want to approach the regression exactly-- i.e. across previous nipype versions, by overriding the module with a paramaterized fixture for rmtree with the additional flag added?

effigies · 2020-01-06T23:53:36Z

No need to parameterize. Just test on master and on this branch. It should fail and pass, respectively.

…rectories=True in the case that stop_on_first_crash=False *Add regression test

dPys · 2020-01-07T00:47:22Z

Removed parameterization. Tested on maint/1.4.x and it passes. Does not yet seem to fail on master as expected so we might still need to tweak the test to add more nodes to workflow (i.e. nodes after the first node that crashes, yet the workflow continues to run and attempt node directory removal for directories that never get created)...

effigies · 2020-01-07T01:22:08Z

Perhaps you could explain more how you were running into this issue? Were you running multiple workflows in parallel?

dPys · 2020-01-07T02:37:07Z

Perhaps you could explain more how you were running into this issue? Were you running multiple workflows in parallel?

Hmm, it occurred repeatedly when running a single workflow (with nested workflows) using MultiProc, and adding ignore_errors=True completely inoculated the issue. Hard to say what the fail context is precisely, but the solution was surefire.

satra · 2020-01-07T02:43:35Z

could this have something to do with the mapnode fix from yesterday?

effigies · 2020-01-07T11:11:21Z

I tried this test on 1.4.0, and it still passes. So whatever it's testing, it's not the error that was being hit.

Checking the coverage, however, it does at least hit the shutil.rmtree line, so that's encouraging. I'm just not sure how the node directory is supposed to get deleted prior to that line. It feels like a race condition, but I also don't see how that's going to happen if jobs aren't being run in parallel.

effigies

Some notes on the test. Also, just so you know, we now use the black styler. If you install pre-commit and run it, it will run black for you when you try to make a commit.

effigies · 2020-01-07T11:37:49Z

nipype/pipeline/plugins/tests/test_base.py

+            if arg1 == 2:
+                raise Exception('arg cannot be ' + str(arg1))
+        except:
+            pass


Is the goal to fail or not? If not, we can skip the entire try/except block. If so, we should not catch the exception.

effigies · 2020-01-07T11:39:43Z

nipype/pipeline/plugins/tests/test_base.py

+        return arg1
+
+    funkynode = pe.MapNode(niu.Function(function=func, input_names=['arg1'],
+                                        output_names=['out']),


This is redundant:

funkynode = pe.MapNode(niu.Function(function=func),

Also, is there something intrinsic about the problem and MapNodes? If not, then perhaps just make it a Node, to reduce the scope as small as possible.

nipype/pipeline/plugins/tests/test_base.py

Co-Authored-By: Chris Markiewicz <effigies@gmail.com>

dPys · 2020-01-07T15:10:03Z

I tried this test on 1.4.0, and it still passes. So whatever it's testing, it's not the error that was being hit.

Checking the coverage, however, it does at least hit the shutil.rmtree line, so that's encouraging. I'm just not sure how the node directory is supposed to get deleted prior to that line. It feels like a race condition, but I also don't see how that's going to happen if jobs aren't being run in parallel.

It does feel very much like a race condition.

dPys · 2020-01-07T15:26:39Z

@effigies -- I'm going to take another look at this on Friday to see if I can't pinpoint exactly what's going on. Things go a little 'nuts' when the iterable expansion starts hitting 10's of thousands of threads...

Stay tuned.

effigies · 2020-11-21T14:13:25Z

Let's rebase and reopen if this becomes an issue again.

[FIX] immunize shutil.rmtree to node non-existence for remove_node_di…

c451b69

…rectories=True in the case that stop_on_first_crash=False *Add regression test

dPys force-pushed the maint/1.4.x branch from 7a0f48d to c451b69 Compare January 6, 2020 23:59

Add reg test

b6474db

effigies mentioned this pull request Jan 7, 2020

[FIX] immunize shutil.rmtree to node non-existence for remove_node_di… #3135

Closed

1 task

effigies reviewed Jan 7, 2020

View reviewed changes

dPys and others added 3 commits January 7, 2020 09:08

Update nipype/pipeline/plugins/tests/test_base.py

a6f03b5

Co-Authored-By: Chris Markiewicz <effigies@gmail.com>

Update nipype/pipeline/plugins/tests/test_base.py

94000dd

Co-Authored-By: Chris Markiewicz <effigies@gmail.com>

Update nipype/pipeline/plugins/tests/test_base.py

b473b6b

Co-Authored-By: Chris Markiewicz <effigies@gmail.com>

effigies closed this Nov 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] immunize shutil.rmtree to node non-existence for remove_node_di… #3148

[FIX] immunize shutil.rmtree to node non-existence for remove_node_di… #3148

dPys commented Jan 6, 2020

dPys commented Jan 6, 2020

effigies commented Jan 6, 2020

codecov bot commented Jan 6, 2020 •

edited

Loading

dPys commented Jan 6, 2020 •

edited

Loading

effigies commented Jan 6, 2020

dPys commented Jan 6, 2020 •

edited

Loading

effigies commented Jan 6, 2020

dPys commented Jan 7, 2020

effigies commented Jan 7, 2020

dPys commented Jan 7, 2020

satra commented Jan 7, 2020

effigies commented Jan 7, 2020

effigies left a comment

effigies Jan 7, 2020

effigies Jan 7, 2020

dPys commented Jan 7, 2020

dPys commented Jan 7, 2020

effigies commented Nov 21, 2020

[FIX] immunize shutil.rmtree to node non-existence for remove_node_di… #3148

[FIX] immunize shutil.rmtree to node non-existence for remove_node_di… #3148

Conversation

dPys commented Jan 6, 2020

dPys commented Jan 6, 2020

effigies commented Jan 6, 2020

codecov bot commented Jan 6, 2020 • edited Loading

Codecov Report

dPys commented Jan 6, 2020 • edited Loading

effigies commented Jan 6, 2020

dPys commented Jan 6, 2020 • edited Loading

effigies commented Jan 6, 2020

dPys commented Jan 7, 2020

effigies commented Jan 7, 2020

dPys commented Jan 7, 2020

satra commented Jan 7, 2020

effigies commented Jan 7, 2020

effigies left a comment

Choose a reason for hiding this comment

effigies Jan 7, 2020

Choose a reason for hiding this comment

effigies Jan 7, 2020

Choose a reason for hiding this comment

dPys commented Jan 7, 2020

dPys commented Jan 7, 2020

effigies commented Nov 21, 2020

codecov bot commented Jan 6, 2020 •

edited

Loading

dPys commented Jan 6, 2020 •

edited

Loading

dPys commented Jan 6, 2020 •

edited

Loading