Description

Taxonomic classification of long DNA sequences and metagenome assembled genomes (e.g. MAGs / bins).

Input

name:type
description
pattern

meta

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

bins

:file

One or more nucleotide FASTA file containing binned long DNA sequences.

*.{fasta,fna,fa,fas}

meta2

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

database

:directory

Directory containing CAT_pack database files (e.g. output from CAT_pack prepare)

*/

meta3

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

taxonomy

:directory

Directory containing CAT_pack taxonomy files (e.g. output from CAT_pack prepare)

*/

meta4

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

proteins

:directory

Optional pre predicted-made proteins FASTA

*.{fasta,faa,fa,fas}

meta5

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

diamond_table

:directory

Optional pre-made DIAMOND alignment table

*

bin_suffix

:string

Suffix to search for in the input files when bins is a directory.

Output

name:type
description
pattern

orf2lca

meta

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*.ORF2LCA.txt

:file

A TSV file with per-ORF hit stats and identified lineage

*.ORF2LCA.txt

bin2classification

meta

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*.bin2classification.txt

:file

A TSV file with per-bin hit stats and assignment justification information

*.bin2classification.txt

log

meta

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*.log

:file

Log file with run messages and basic statistics

diamond

meta

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*.diamond

:file

Intermediate DIAMOND TSV summary output file with alignment results

faa

meta

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*.predicted_proteins.faa

:file

FAA file of DIAMOND predicted proteins hits

gff

meta

:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*.gff

:file

GFF file of DIAMOND predicted proteins hits

versions_catpack

${task.process}

:string

The process the versions were collected from

catpack

:string

The tool name

CAT_pack --version | sed 's/CAT_pack pack v//g;s/ .*//g'

:eval

The expression to obtain the version of the tool

Topics

name:type
description
pattern

versions

${task.process}

:string

The name of the process

catpack

:string

The name of the tool

CAT_pack --version | sed 's/CAT_pack pack v//g;s/ .*//g'

:eval

The expression to obtain the version of the tool

Tools

catpack
MIT

CAT/BAT: tool for taxonomic classification of contigs and metagenome-assembled genomes (MAGs)