CWL Parsing #22

illusional · 2019-05-28T04:51:18Z

Hey all,

This is the start of my parsing a CWL into the workflow classes, with the goal for it to be almost automatic (with a couple of annotated hints).

Most of the work is in utils, based on my previous Serializable submission.

https://github.com/illusional/python-cwlgen/blob/parsing/cwlgen/utils.py#L69-L177

I make two assumptions:

CWL you parse is valid, there's no sanity checking (you can use CWLTool) for that.
The field names in CWL (and the dict) EXACTLY match the attribute names on the class. It will throw a KeyException if this is not the case.

Essentially, it loops through the keys on the CWL Dictionary and sets that attribute (if it exists) on the required class. You can provide an ordered list of "potential" classes that it might be, and it will do its best to go through until it finds one.

You can see how this works on the Workflow (L41-44) class.

I've swapped out the converter and left the CommandLineTool tests (from @khillion) the same (except for secondaryFiles) and it seems to work okay.

@kellrott, do you mind having a look at my code, as both yourself at @khillion are the people who have worked on workflow parsing, and this should just extend it to Workflow (for now).

…wlgen into parsing

codecov · 2019-05-28T04:53:24Z

Codecov Report

Merging #22 into master will increase coverage by 6.38%.
The diff coverage is 83%.

@@            Coverage Diff             @@
##           master      #22      +/-   ##
==========================================
+ Coverage   76.75%   83.13%   +6.38%     
==========================================
  Files           9       10       +1     
  Lines         684      593      -91     
==========================================
- Hits          525      493      -32     
+ Misses        159      100      -59

Impacted Files	Coverage Δ
cwlgen/commandlinetool.py	`87.34% <100%> (+0.85%)`	⬆️
cwlgen/__init__.py	`100% <100%> (ø)`	⬆️
cwlgen/common.py	`79.43% <100%> (+0.59%)`	⬆️
cwlgen/workflow.py	`91.83% <100%> (+10.35%)`	⬆️
cwlgen/workflowdeps.py	`79.31% <79.31%> (ø)`
cwlgen/utils.py	`82.72% <80.24%> (-6.93%)`	⬇️
cwlgen/import_cwl.py	`94.44% <87.5%> (+28.8%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ca06118...71ff30a. Read the comment docs.

multimeric · 2019-07-05T03:35:09Z

cwlgen/commandlinetool.py

+        :type glob: STRING
+        :param load_contents: For each file matched, read up to the 1st 64 KiB of text and
+                              place it in the contents field
+        :type load_contents: BOOLEAN


I notice this isn't an actual Python type (should be bool). Is this intentional? It doesn't seem ideal

It's a legacy thing that I haven't changed. Might be worth changing in #24

If/once you deprecate Python 2, you can make these all PEP 484 type annotations and then most tooling should work better

multimeric · 2019-07-05T03:37:56Z

cwlgen/commandlinetool.py

@@ -15,13 +16,109 @@
 #  Class(es)  ------------------------------


+class CommandOutputBinding(Serializable):


These classes should probably be dataclasses, no? That would save you from having to define the __init__ methods all the time. Admittedly they require Python 3.6 (via backport) or Python 3.7 (natively). Not sure what versions you're supporting.

Woah that's sick! I didn't know Dataclasses were a thing! When this project drops EOL support for Python 2.7 I reckon this is definitely the way to go.

cwlgen/utils.py

cwlgen/workflowdeps.py

multimeric · 2019-07-05T04:23:16Z

test/test_unit_import_workflow.py

@@ -0,0 +1,111 @@
+#!/usr/bin/env python


Did you think about doing an integration test that does CWL → Python → CWL and comparing the input and output CWL?

I think it would be a good test to improve this translation mechanism. For this change I know it won't produce identical CWL because it's opinionated on the dict / array output of inputs / outputs / steps.

It would be good to functionally validate the CWL tbh.

(not just convert the array of strings to a string "['value']" haha

illusional · 2019-07-05T06:05:22Z

Thanks @TMiguelT, all tests pass so I'm going to merge this!

illusional added 12 commits April 29, 2019 15:05

Ensure 'None' is serialized if it's present in the dictionary

cd38bb8

Add method to parse classes (automatically) from CWL dict

3feb39f

Add more features to automatic parsing of CWL

85abf54

Refactor Workflow dependencies + add parse_with_id for list/dict parsing

eced0ac

Rename ignore_ attribute names for consistency

4d5c2d1

Modify CWLTool parser to use new parsing method

3553e62

Swap out parser in TestImport and fix broken tests

df73a7b

Add Workflow unit tests

a5126ac

Merge branch 'master' of github.com:common-workflow-language/python-c…

e34cbae

…wlgen into parsing

Python2 and Python3 support for inspect.getargspec + speed up tests

499695a

Modify dictionary order to ensure tests succeed

105774c

Disallow extra params (on class) when parsing and ensure ignore fields

2dce419

illusional mentioned this pull request Jul 5, 2019

V0.3.0 release #24

Merged

Ensure some fields on commandlinetool do not get converted by default

04368eb

multimeric approved these changes Jul 5, 2019

View reviewed changes

illusional added 7 commits July 5, 2019 15:41

Add better docstrings to import_cwl

2c0c55b

Convert parse_types hints to dictionary over k-v tuples + docstring

56bc1af

Propagate changes from parse_types previous commit (kv-tuple -> dict)

cafe1dc

Per previous commit

e955d49

Improve documentation for parse_cwl / parse_cwl_dict on docs

36cc0ff

Ensure Workflow Output param is labelled + document workflow 'in' method

6d19e63

Providing an primitive hint T should pass an array of T

71ff30a

(not just convert the array of strings to a string "['value']" haha

illusional merged commit d322635 into common-workflow-lab:master Jul 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

CWL Parsing #22

CWL Parsing #22

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		@@ -15,13 +16,109 @@
		# Class(es) ------------------------------


		class CommandOutputBinding(Serializable):

Uh oh!

CWL Parsing #22

CWL Parsing #22

Uh oh!

Conversation

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!