merge the dev to master to 2.0 version release #86

jianhong · 2023-06-16T13:48:29Z

PR checklist

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

update the zenodo doi.

…to dev

…to dev2

edmundmiller

This is a great effort, and it's obvious you've put a lot of time into this!

It seems there are a couple of repetitive things that I'd like to see addressed (beyond what @mirpedrol and @SPPearce pointed out)

Inline scripts in Nextflow files. Those should be in bin/ or in a custom module. Just a nf-core thing, it makes it easier to maintain. I'm guessing you did that to be able to test the nextflow local modules, but since you've written so many custom scripts, I think it might be better to just test them with testthat, or another CLI tester. They're really extensive, so it would be nice to have confidence in them!
I'd like to see the hicexplorer, homer, etc. modules live in nf-core/modules
Make an image for those scary java tools 🙈
Homer modules need to use fasta input rather than the outside of Nextflow genome install. I'm not sure if it's working as intended as is, it would install the genome in the process' workdir.
Clean up subworkflows only outputting versions, and empty outputs. It'll make it more maintainable.

So I don't think this is entirely up to nf-core standards of maintainability, but I think in the name of progress we merge this and maybe tag it with 2.0.0-rc or something if 2.0.0 is controversial. Also, up to @jianhong and how they want to release it.

Before I would approve this PR, I'd like to see a roadmap tracked in the milestones of these issues to be addressed from this PR (and maybe with a list of modules to move scripts out of, for example). I think those will be easier to address in separate, smaller PRs.

We should also make sure proper repo access is given to @jianhong, so they can work on branches in this repo and create smaller PRs, so the review process doesn't take multiple months, and so we can review the scientific ideas in the code changes.

edmundmiller · 2023-07-12T20:53:57Z

CHANGELOG.md

+- update to nf-core-template-2.7.2
+- replace the dots by '\_' in the samples name
+- add support for MseI.
+- add TAD, AB compartments, APA analysis (see available tools in usage documentation)
+- add additional methods for interaction caller
+- adjust default resource requirement
+- update the CITAIONS
+- re-arrange the output folder to meet the requirements of multiple analysis tools
+- updated multipe documentations
+- remove the parameter for java resources requirements
+- update annotations from 5k,10k bins to nearest R2 peaks
+- removed local_modules tests
+- add scale factor to atac reads coverage calculation
+- change the circos plot style
+- update reads_summary table to include unmapped/multiple mapped info
+- add the possibility to subsample to balance the input reads
+- add parameter to let user input the 5 end sequence to cutadapt step
+- change the cutadapt error tolerance from 0 to 0.15
+- changes in hipeak caller:
+  - add filter condition for lowest count number as 1.
+  - change the cutoff value for hipeak type assignment step from 12 to 3.
+- fix multiple bugs:
+  - the sorting method for huge bed file;
+  - the post count for hipeak when there is empty Interactions;
+  - add totalLinks parameter for prepare_circos;
+  - the issue if bplapply does not work in differential analysis for hipeak;
+  - fix the space issue for enzyme_cut.nf;
+  - fix the chromosome style for homer TFEA analysis and annotation by ChIPpeakAnno;
+  - fix the duplicated imported modules;
+  - fix multiple typos.
+  - use local SAMTOOLS_SORT module;


I was going to ask if there were any links to these. But then I remembered this was all done on a fork. Maybe link some commits?

I think we need to add something to the nf-core docs for people to reach out if they're developing on a pipeline.

edmundmiller · 2023-07-12T20:56:00Z

assets/test_multi_samplesheet.csv

-WT,2,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T2/WT_rep2_R1.fastq.gz,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T2/WT_rep2_R2.fastq.gz,,
-KD,1,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T2/KD_rep1_R1.fastq.gz,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T2/KD_rep1_R2.fastq.gz,,
-KD,2,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T2/KD_rep2_R1.fastq.gz,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T2/KD_rep2_R2.fastq.gz,,
+WT.dot,1,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T1/WT_rep1_R1.fastq.gz,https://raw.githubusercontent.com/nf-core/test-datasets/hicar/data/genomics/homo_sapiens/T1/WT_rep1_R2.fastq.gz,,


I wonder if it would break stuff, or are the groups separated using . ?

edmundmiller · 2023-07-12T20:58:35Z

conf/test.config


    // Output folder
-    outdir = 'test_results'
+    outdir = 'results'


I believe these were removed because they could cause people using it on cloud to write the results to never write to a bucket. It is just the test config, just a consideration.

edmundmiller · 2023-07-12T20:59:36Z

conf/test_full.config

@@ -18,7 +20,7 @@ params {
    input = "${projectDir}/assets/test_full_samplesheet.csv"

    // Output folder
-    outdir = 'full_test_results'
+    outdir = 'results'


See the above comment, this one probably doesn't need to be set by default, it'll just result in a foot gun that's worse than users have to specify --outdir results

edmundmiller · 2023-07-12T21:02:05Z

modules/local/atacreads/atacqc.nf

Is there a reason this isn't a script in bin/?

edmundmiller · 2023-07-12T21:26:42Z

modules/local/juicer/addnorm.nf

+
+    input:
+    tuple val(meta), path(hic)
+    path hic_tools_jar


This should be in a container. I'm guessing it's https://github.com/aidenlab/HiCTools.

Nvm I looked at it and that looks like a nightmare, I guess I'll get to the config and this points at the jar download?

edmundmiller · 2023-07-12T21:28:11Z

modules/local/maps/maps.nf

+    val long_bedpe_postfix
+    val short_bed_postfix


Could be args in a config

edmundmiller · 2023-07-12T21:29:39Z

modules/local/re_cut.nf

+    cat <<-END_VERSIONS > versions.yml
+    "${task.process}":
+        python: \$(echo \$(python --version) | sed 's/Python //')
+    END_VERSIONS
+
+    restriction_enzyme_cutsite.py $enzyme


Suggested change

cat <<-END_VERSIONS > versions.yml

"${task.process}":

python: \$(echo \$(python --version) | sed 's/Python //')

END_VERSIONS

restriction_enzyme_cutsite.py $enzyme

restriction_enzyme_cutsite.py $enzyme

cat <<-END_VERSIONS > versions.yml

"${task.process}":

python: \$(echo \$(python --version) | sed 's/Python //')

END_VERSIONS

edmundmiller · 2023-07-12T21:31:36Z

modules/local/samtools/sort/main.nf

Could why this is a local module get documented? What'st the reason for not using a patch and the nf-core module? That will help keep this maintainable.

edmundmiller · 2023-07-12T21:35:36Z

nextflow.config


    // Boilerplate options
-    outdir                     = null
+    outdir                     = 'results'


Suggested change

outdir = 'results'

outdir = null

merge template 2.9

fix a issue in if id contains dots.

Dev remove output folder from config file

fix the meta id issue.

JoseEspinosa

Huge work @jianhong! 😄
Apart from my suggestions in the code, I agree with @emiller88 summary

JoseEspinosa · 2023-07-17T10:03:16Z

README.md

@@ -22,7 +19,7 @@ The pipeline can also handle the experiment of HiChIP, ChIA-PET, and PLAC-Seq. I

 The pipeline is built using [Nextflow](https://www.nextflow.io), a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker/Singularity containers making installation trivial and results highly reproducible. The [Nextflow DSL2](https://www.nextflow.io/docs/latest/dsl2.html) implementation of this pipeline uses one container per process which makes it much easier to maintain and update software dependencies. Where possible, these processes have been submitted to and installed from [nf-core/modules](https://github.com/nf-core/modules) in order to make them available to all nf-core pipelines, and to everyone within the Nextflow community!

-On release, automated continuous integration tests run the pipeline on a full-sized dataset on the AWS cloud infrastructure. This ensures that the pipeline runs on AWS, has sensible resource allocation defaults set to run on real-world datasets, and permits the persistent storage of results to benchmark between pipeline releases and other analysis sources. The results obtained from the full-sized test can be viewed on the [nf-core website](https://nf-co.re/hicar/results).
+On release, automated continuous integration tests run the pipeline on a full-sized dataset on the AWS cloud infrastructure. This ensures that the pipeline runs on AWS, has sensible resource allocation defaults set to run on real-world datasets, and permits the persistent storage of results to benchmark between pipeline releases and other analysis sources.The results obtained from the full-sized test can be viewed on the [nf-core website](https://nf-co.re/hicar/results).


Suggested change

On release, automated continuous integration tests run the pipeline on a full-sized dataset on the AWS cloud infrastructure. This ensures that the pipeline runs on AWS, has sensible resource allocation defaults set to run on real-world datasets, and permits the persistent storage of results to benchmark between pipeline releases and other analysis sources.The results obtained from the full-sized test can be viewed on the [nf-core website](https://nf-co.re/hicar/results).

On release, automated continuous integration tests run the pipeline on a full-sized dataset on the AWS cloud infrastructure. This ensures that the pipeline runs on AWS, has sensible resource allocation defaults set to run on real-world datasets, and permits the persistent storage of results to benchmark between pipeline releases and other analysis sources. The results obtained from the full-sized test can be viewed on the [nf-core website](https://nf-co.re/hicar/results).

JoseEspinosa · 2023-07-17T10:32:29Z

conf/modules.config

 process {
    publishDir = [
        path: { "${params.outdir}/${task.process.tokenize(':')[-1].tokenize('_')[0].toLowerCase()}" },
        mode: params.publish_dir_mode,
-        saveAs: { filename -> filename.equals('versions.yml') ? null : filename },
-        enabled: false
+        saveAs: { filename -> filename.equals('versions.yml') ? null : filename }


Wouldn't it be better to enable it to false by default again and not need to declare some processes just to avoid the publication of its files (e.g. GUNZIP)?

JoseEspinosa · 2023-07-17T10:37:48Z

conf/test.config

-
-    // Output folder
-    outdir = 'test_results'
+    input = "${projectDir}/assets/test_multi_samplesheet.csv"


Should this not be available in test-datasets instead ?

I personally like to keep the test samplesheets in the repo as examples as well. That ensures they stay up to date! 😆

JoseEspinosa · 2023-07-17T10:38:18Z

conf/test_full.config

@@ -17,9 +17,6 @@ params {
    // Input data for full size test
    input = "${projectDir}/assets/test_full_samplesheet.csv"


Same as my comment above

JoseEspinosa · 2023-07-17T10:39:59Z

conf/test_hipeak.config

@@ -20,17 +20,17 @@ params {
    // Input data for full size test
    input = "${projectDir}/assets/test_multi_samplesheet.csv"


Same here, and any other input instance in the configs

JoseEspinosa · 2023-07-17T11:30:39Z

subworkflows/local/tads_caller/homer.nf

+        genome).tads
+    ch_versions = HOMER_FINDTADSANDLOOPS_TADS.out.versions
+
+


Suggested change

JoseEspinosa · 2023-07-17T11:31:53Z

subworkflows/local/tfea/homer.nf

+    HOMER_FINDMOTIFSGENOME(ch_new_bed, additional_param)
+    ch_versions = ch_versions.mix(HOMER_FINDMOTIFSGENOME.out.versions)
+
+


Suggested change

JoseEspinosa · 2023-07-17T11:32:10Z

subworkflows/local/v4c.nf

+            break
+    }
+
+


Suggested change

JoseEspinosa · 2023-07-17T11:32:39Z

subworkflows/local/v4c.nf

+
+
+    emit:
+    versions        = ch_versions          // channel: [ versions.yml ]


Also here only version is emitted, is this correct?

JoseEspinosa · 2023-07-17T11:34:36Z

subworkflows/local/v4c/cooltools.nf

+    versions        = ch_versions          // channel: [ versions.yml ]
+    v4c             = VIRTUAL4C_BY_COOLTOOLS.out.v4c


Suggested change

versions = ch_versions // channel: [ versions.yml ]

v4c = VIRTUAL4C_BY_COOLTOOLS.out.v4c

v4c = VIRTUAL4C_BY_COOLTOOLS.out.v4c

versions = ch_versions // channel: [ versions.yml ]

Also add channel content description

Fix multiple minor typos for review comments.

github-advanced-security · 2025-05-05T22:14:39Z

This pull request sets up GitHub code scanning for this repository. Once the scans have completed and the checks have passed, the analysis results for this pull request branch will appear on this overview. Once you merge this pull request, the 'Security' tab will show more code scanning analysis results (for example, for the default branch). Depending on your configuration and choice of analysis tool, future pull requests will be annotated with code scanning analysis results. For more information about GitHub code scanning, check out the documentation.

.github/workflows/fix-linting.yml

+    if: >
+      contains(github.event.comment.html_url, '/pull/') &&
+      contains(github.event.comment.body, '@nf-core-bot fix linting') &&
+      github.repository == 'nf-core/hicar'
+    runs-on: ubuntu-latest
+    steps:
+      # Use the @nf-core-bot token to check out so we can push later
+      - uses: actions/checkout@v3
+        with:
+          token: ${{ secrets.nf_core_bot_auth_token }}
+
+      # Action runs on the issue comment, so we don't get the PR by default
+      # Use the gh cli to check out the PR
+      - name: Checkout Pull Request
+        run: gh pr checkout ${{ github.event.issue.number }}
+        env:
+          GITHUB_TOKEN: ${{ secrets.nf_core_bot_auth_token }}
+
+      - uses: actions/setup-node@v3
+
+      - name: Install Prettier
+        run: npm install -g prettier @prettier/plugin-php
+
+      # Check that we actually need to fix something
+      - name: Run 'prettier --check'
+        id: prettier_status
+        run: |
+          if prettier --check ${GITHUB_WORKSPACE}; then
+            echo "result=pass" >> $GITHUB_OUTPUT
+          else
+            echo "result=fail" >> $GITHUB_OUTPUT
+          fi
+
+      - name: Run 'prettier --write'
+        if: steps.prettier_status.outputs.result == 'fail'
+        run: prettier --write ${GITHUB_WORKSPACE}

+      - name: Commit & push changes
+        if: steps.prettier_status.outputs.result == 'fail'
+        run: |
+          git config user.email "core@nf-co.re"
+          git config user.name "nf-core-bot"
+          git config push.default upstream
+          git add .
+          git status
+          git commit -m "[automated] Fix linting with Prettier"
+          git push


To fix the issue, we need to add a permissions block to the workflow. Since the workflow involves reading repository contents, checking out pull requests, and pushing changes, we will explicitly define the required permissions. The minimal permissions required are:

contents: write (to push changes to the repository).

pull-requests: write (to interact with pull requests).

The permissions block should be added at the root level of the workflow to apply to all jobs unless overridden.

.github/workflows/fix-linting.yml

+      - name: Install Prettier
+        run: npm install -g prettier @prettier/plugin-php
+
+      # Check that we actually need to fix something
+      - name: Run 'prettier --check'


.github/workflows/fix-linting.yml

+      - name: Run 'prettier --write'
+        if: steps.prettier_status.outputs.result == 'fail'
+        run: prettier --write ${GITHUB_WORKSPACE}
+
+      - name: Commit & push changes


jianhong and others added 30 commits May 3, 2022 14:51

update README.md for Zenodo DOI.

35b8872

Update README.md

beb7bdc

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

update the version number to dev.

1b5c67a

Update README.md

430c697

Co-authored-by: James A. Fellows Yates <jfy133@gmail.com>

update changelog.md

22c0c70

Update CHANGELOG.md

d2e6675

Merge pull request #69 from jianhong/master

373f7a4

update the zenodo doi.

Template update for nf-core/tools version 2.4

7a885d4

add support for MseI.

64bfcec

Merge remote-tracking branch 'upstream/dev' into dev

d2b193a

Merge remote-tracking branch 'upstream/nf-core-template-merge-2.4' in…

548783e

…to dev

fix the issue of multiqc_config file

ba146a3

fix a issue in README.

5694def

add TAD, AB compartments, APA analysis

85b2b49

change the pipeline to use different tools

2dc6af9

update workflow image.

1521b32

re-orgnize the juicer tools modules.

00c264b

add multiple scripts for homer

708255f

update scripts for hicdcplus

a1b9f1a

add filter condition for prepare_counts.

4b24e0f

fix a typo in homer_compartments

05d1ef3

Template update for nf-core/tools version 2.5

e831c97

Template update for nf-core/tools version 2.5.1

75abb89

Template update for nf-core/tools version 2.6

37c07b4

change the output folder for test_hipeak.conf

a64c3a3

fix a bug if juicer_tools_jar is false, there is no JUICER_PRE.out

d275f9a

Merge remote-tracking branch 'upstream/nf-core-template-merge-2.6' in…

2da1032

…to dev2

fix lint

618c10b

prettier write

83146a5

fix the space issue for enzyme_cut.nf

98cbc87

remove unused files; change file exists check by nf-validation.

c6d9fcd

edmundmiller self-requested a review July 12, 2023 20:48

edmundmiller reviewed Jul 12, 2023

View reviewed changes

jianhong added 17 commits July 12, 2023 19:22

try to fix an issue for annotation when empty file is input.

64e3423

remove the input_check.nf.

b5b9978

simplify the input meta data rearrangement code.

2ad59a0

Merge pull request #89 from jianhong/dev

41312de

merge template 2.9

remove output folder in the config files.

c7ed956

Merge branch 'nf-core:dev' into dev_remove_output_folder

7c41b30

update Changelog.md

1c22d9d

update Changelog.md

a526992

fix a issue in if id contains dots.

a3da5bb

Merge pull request #92 from jianhong/dev

24457c9

fix a issue in if id contains dots.

Merge branch 'nf-core:dev' into dev_remove_output_folder

510fc86

fix the meta issue.

7411a4c

improve the meta id related codes.

91febad

Merge pull request #90 from jianhong/dev_remove_output_folder

1ffcd8b

Dev remove output folder from config file

Merge branch 'nf-core:dev' into dev

d3fcb8c

Merge pull request #93 from jianhong/dev

1bef3c1

fix the meta id issue.

update the documentation for technique replicate.

7c75e59

JoseEspinosa requested changes Jul 17, 2023

View reviewed changes

jianhong added 7 commits July 17, 2023 08:45

fix a bug for pairsqcplot if no data is avalable.

ea77a88

Merge remote-tracking branch 'origin/dev2' into dev

16e7348

update documentations.

7ef9dbd

fix some minor stuff for the review comments.

6218288

Merge branch 'nf-core:dev' into dev

c416194

Run prettier -w.

59391ab

Merge pull request #96 from jianhong/dev

d2d17a9

Fix multiple minor typos for review comments.

github-advanced-security bot found potential problems May 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge the dev to master to 2.0 version release #86

merge the dev to master to 2.0 version release #86

Check warning

Copilot Autofix

Check failure

Check failure

		@@ -17,9 +17,6 @@ params {
		// Input data for full size test
		input = "${projectDir}/assets/test_full_samplesheet.csv"

		@@ -20,17 +20,17 @@ params {
		// Input data for full size test
		input = "${projectDir}/assets/test_multi_samplesheet.csv"

		genome).tads
		ch_versions = HOMER_FINDTADSANDLOOPS_TADS.out.versions

		HOMER_FINDMOTIFSGENOME(ch_new_bed, additional_param)
		ch_versions = ch_versions.mix(HOMER_FINDMOTIFSGENOME.out.versions)

		versions = ch_versions // channel: [ versions.yml ]
		v4c = VIRTUAL4C_BY_COOLTOOLS.out.v4c

@@ -5,2 +5,6 @@
+            permissions:
+              contents: write
+              pull-requests: write
             jobs:

merge the dev to master to 2.0 version release #86

Are you sure you want to change the base?

merge the dev to master to 2.0 version release #86

Conversation

PR checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Check warning

Copilot Autofix

Check failure

Check failure