Workflow Type: Galaxy
Open
Secondary metabolite biosynthetic gene cluster (SMBGC) Annotation using Neural Networks Trained on Interpro Signatures
Associated Tutorial
This workflows is part of the tutorial Marine Omics identifying biosynthetic gene clusters, available in the GTN
Features
- Includes Galaxy Workflow Tests
- Includes a Galaxy Workflow Report
- Uses Galaxy Workflow Comments
Thanks to...
Workflow Author(s): Marie Jossé
Tutorial Author(s): Marie Josse
Funder(s): Fair-Ease, EuroScienceGateway
Inputs
ID | Name | Description | Type |
---|---|---|---|
Fasta nucelotide file | Fasta nucelotide file | BGC0001472.fna |
|
Steps
ID | Name | Description |
---|---|---|
1 | Prodigal Gene Predictor | Create the protein fasta file toolshed.g2.bx.psu.edu/repos/iuc/prodigal/prodigal/2.6.3+galaxy0 |
2 | Sanntis: Build Genbank | Use of Sanntis toolshed.g2.bx.psu.edu/repos/ecology/sanntis_marine/sanntis_marine/0.9.3.5+galaxy1 |
3 | Regex Find And Replace | Remove useless * in the protein fasta file toolshed.g2.bx.psu.edu/repos/galaxyp/regex_find_replace/regex1/1.0.3 |
4 | InterProScan | Create TSV file for Sanntis toolshed.g2.bx.psu.edu/repos/bgruening/interproscan/interproscan/5.59-91.0+galaxy3 |
5 | Sanntis: identify biosynthetic gene clusters | toolshed.g2.bx.psu.edu/repos/ecology/sanntis_marine/sanntis_marine/0.9.3.5+galaxy1 |
Outputs
ID | Name | Description | Type |
---|---|---|---|
Protein fasta file | Protein fasta file | n/a |
|
Genbank file | Genbank file | n/a |
|
Clean protein fasta file | Clean protein fasta file | n/a |
|
Tabular file (.tsv) | Tabular file (.tsv) | n/a |
|
SMBGC Annotation | SMBGC Annotation | n/a |
|
Version History
1.0 (earliest) Created 26th Aug 2024 at 14:06 by Helena Rasche
Added/updated 4 files
Open
master
3fecd1a
Creators and Submitter
Creators
Not specifiedSubmitter
Tools
Activity
Views: 240 Downloads: 60
Created: 26th Aug 2024 at 14:06
Last updated: 26th Aug 2024 at 14:06
Attributions
None