Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update from feeds: https://galaxyproject.org/news/2024-11-15-mgnify-v5/ #146

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

galaxy-social-bot[bot]
Copy link
Contributor

This PR is created automatically by a feeds bot.
Update since 2024-11-04

Processed:
MGnify amplicon pipeline v5.0 on Galaxy

Copy link
Contributor

👋 Hello! I'm your friendly social media assistant. Below are the previews of this post:
posts/feed_bot/galaxy_europe/2024-11-15-mgnify-v5-group1.md

matrix-eu-announce


MGnify_v5.0

📝 New blog post Released! https://galaxyproject.org/news/2024-11-15-mgnify-v5/

MGnify amplicon pipeline v5.0

The microbial world is vast and complex, and the MGnify amplicon pipeline has long been a cornerstone for processing and analyzing 16S, 18S, and ITS rRNA gene amplicon sequences. Its version 5.0 brings advanced features and enhanced accuracy to microbiome research. But how can this pipeline be made even more accessible to researchers worldwide? The answer: porting it to Galaxy.

This diagram illustrates the MGnify amplicon pipeline v5.0 workflow for microbial amplicon analysis. It starts by processing paired-end or single-end reads, merging and trimming them with SeqPrep and Trimmomatic, followed by quality control checks and filtering.

The pipeline then splits into two paths: rRNA prediction and ITS sequence analysis. For rRNA-prediction, sequences are identified using the Rfam database and classified into SSU and LSU regions. ITS sequences are separated, masked with bedtools, and classified with databases like UNITE and SILVA.

Finally, all taxonomic results are visualized with Krona, providing an interactive view of microbial composition. This setup enables efficient, thorough microbial profiling of amplicon data.

Availability on Galaxy

Version 5.0 of the MGnify amplicon pipeline is now fully available on the Galaxy EU server, bringing advanced microbial analysis workflows to researchers through an accessible, web-based interface.

The pipeline has been ported into five modular sub-workflows, allowing users flexibility in their analysis:

  • Quality control for single-end reads
  • Quality control for paired-end reads
  • rRNA-prediction
  • ITS analysis
  • Summary tables

These sub-workflows can be run individually or combined to execute the entire pipeline, giving researchers the freedom to tailor the analysis to their specific needs. This modular design ensures accessibility and adaptability for diverse microbiome research projects. The pipeline is designed to take NCBI reads as input, but users can easily modify it to work with their own data uploaded directly to the Galaxy history. In addition to the main sub-workflows, two additional workflows have been created to simplify downstream analysis:

  • A workflow that generates summary tables at a user-specified taxonomic rank
  • A workflow that formats MAPseq results to produce ampvis2-compatible input

These workflows make it easier for users to customize and visualize their analysis results, enhancing the overall utility of the pipeline.

Testing the pipeline

During the process, the ported pipeline was tested against its counterpart, yielding promising results.

The figure above compares the relative abundance of taxa between the MGnify and Galaxy pipelines. It shows the results from 10 randomly selected human gut samples. Taxa that were present in Galaxy but absent in MGnify were removed to emphasize the discrepancies. Overall, the results demonstrate highly similar patterns between the two pipelines, with minor discrepancies attributed to the non-deterministic nature of the MAPseq approach.

What's Next?

The successful port of MGnify pipeline v5.0 is just the beginning. Progress is being made in porting the new version, v6.0, to Galaxy. This version is more sophisticated, includes additional sub-workflows and tools, utilizes a broader range of databases, and features an updated version of MAPseq with a seed option.

linkedin-galaxyproject

📝 New blog post Released!
https://galaxyproject.org/news/2024-11-15-mgnify-v5/

MGnify amplicon pipeline v5.0

The microbial world is vast and complex, and the MGnify amplicon pipeline has long been a cornerstone for processing and analyzing 16S, 18S, and ITS rRNA gene amplicon sequences. Its version 5.0 brings advanced features and enhanced accuracy to microbiome research. But how can this pipeline be made even more accessible to researchers worldwide? The answer: porting it to Galaxy.

: https://docs.mgnify.org/src/docs/analysis.html

This diagram illustrates the MGnify amplicon pipeline v5.0 workflow for microbial amplicon analysis. It starts by processing paired-end or single-end reads, merging and trimming them with SeqPrep and Trimmomatic, followed by quality control checks and filtering.

The pipeline then splits into two paths: rRNA prediction and ITS sequence analysis. For rRNA-prediction, sequences are identified using the Rfam database and classified into SSU and LSU regions. ITS sequences are separated, masked with bedtools, and classified with databases like UNITE and SILVA.

Finally, all taxonomic results are visualized with Krona, providing an interactive view of microbial composition. This setup enables efficient, thorough microbial profiling of amplicon data.

Availability on Galaxy

Version 5.0 of the MGnify amplicon pipeline is now fully available on the Galaxy EU server, bringing advanced microbial analysis workflows to researchers through an accessible, web-based interface.

The pipeline has been ported into five modular sub-workflows, allowing users flexibility in their analysis:

  • Quality control for single-end reads
  • Quality control for paired-end reads
  • rRNA-prediction
  • ITS analysis
  • Summary tables

These sub-workflows can be run individually or combined to execute the entire pipeline, giving researchers the freedom to tailor the analysis to their specific needs. This modular design ensures accessibility and adaptability for diverse microbiome research projects. The pipeline is designed to take NCBI reads as input, but users can easily modify it to work with their own data uploaded directly to the Galaxy history. In addition to the main sub-workflows, two additional workflows have been created to simplify downstream analysis:

  • A workflow that generates summary tables at a user-specified taxonomic rank
  • A workflow that formats MAPseq results to produce ampvis2-compatible input

These workflows make it easier for users to customize and visualize their analysis results, enhancing the overall utility of the pipeline.

Testing the pipeline

During the process, the ported pipeline was tested against its counterpart, yielding promising results. (1/2)

The figure above compares the relative abundance of taxa between the MGnify and Galaxy pipelines. It shows the results from 10 randomly selected human gut samples. Taxa that were present in Galaxy but absent in MGnify were removed to emphasize the discrepancies. Overall, the results demonstrate highly similar patterns between the two pipelines, with minor discrepancies attributed to the non-deterministic nature of the MAPseq approach.

What's Next?

The successful port of MGnify pipeline v5.0 is just the beginning. Progress is being made in porting the new version, v6.0, to Galaxy. This version is more sophisticated, includes additional sub-workflows and tools, utilizes a broader range of databases, and features an updated version of MAPseq with a seed option.

#UseGalaxy #GalaxyProject #UniFreiburg #EOSC #EuroScienceGateway (2/2)


MGnify_v5.0

mastodon-eu-freiburg

📝 New blog post Released!
https://galaxyproject.org/news/2024-11-15-mgnify-v5/

MGnify amplicon pipeline v5.0

The microbial world is vast and complex, and the MGnify amplicon pipeline has long been a cornerstone for processing and analyzing 16S, 18S, and ITS rRNA gene amplicon sequences. Its version 5.0 brings advanced features and enhanced accuracy to microbiome research. But how can this pipeline be made even more accessible to researchers worldwide? The answer: porting it to Galaxy.

: https://docs.mgnify.org/src/docs/analysis.html (1/8)

This diagram illustrates the MGnify amplicon pipeline v5.0 workflow for microbial amplicon analysis. It starts by processing paired-end or single-end reads, merging and trimming them with SeqPrep and Trimmomatic, followed by quality control checks and filtering.

The pipeline then splits into two paths: rRNA prediction and ITS sequence analysis. For rRNA-prediction, sequences are identified using the Rfam database and classified into SSU and LSU regions. ITS sequences are separated, (2/8)

masked with bedtools, and classified with databases like UNITE and SILVA.

Finally, all taxonomic results are visualized with Krona, providing an interactive view of microbial composition. This setup enables efficient, thorough microbial profiling of amplicon data.

Availability on Galaxy

Version 5.0 of the MGnify amplicon pipeline is now fully available on the Galaxy EU server, bringing advanced microbial analysis workflows to researchers through an accessible, web-based interface. (3/8)

The pipeline has been ported into five modular sub-workflows, allowing users flexibility in their analysis:

  • Quality control for single-end reads
  • Quality control for paired-end reads
  • rRNA-prediction
  • ITS analysis
  • Summary tables

These sub-workflows can be run individually or combined to execute the entire pipeline, giving researchers the freedom to tailor the analysis to their specific needs. This modular design ensures accessibility and adaptability for diverse microbiome (4/8)

research projects. The pipeline is designed to take NCBI reads as input, but users can easily modify it to work with their own data uploaded directly to the Galaxy history. In addition to the main sub-workflows, two additional workflows have been created to simplify downstream analysis:

  • A workflow that generates summary tables at a user-specified taxonomic rank
  • A workflow that formats MAPseq results to produce ampvis2-compatible input

These workflows make it easier for users to (5/8)

customize and visualize their analysis results, enhancing the overall utility of the pipeline.

Testing the pipeline

During the process, the ported pipeline was tested against its counterpart, yielding promising results. (6/8)

The figure above compares the relative abundance of taxa between the MGnify and Galaxy pipelines. It shows the results from 10 randomly selected human gut samples. Taxa that were present in Galaxy but absent in MGnify were removed to emphasize the discrepancies. Overall, the results demonstrate highly similar patterns between the two pipelines, with minor discrepancies attributed to the non-deterministic nature of the MAPseq approach.

What's Next?

The successful port of MGnify pipeline (7/8)

v5.0 is just the beginning. Progress is being made in porting the new version, v6.0, to Galaxy. This version is more sophisticated, includes additional sub-workflows and tools, utilizes a broader range of databases, and features an updated version of MAPseq with a seed option.

@[email protected]
#UseGalaxy #GalaxyProject #UniFreiburg #EOSC #EuroScienceGateway (8/8)


MGnify_v5.0

Copy link
Contributor

👋 Hello! I'm your friendly social media assistant. Below are the previews of this post:
posts/feed_bot/galaxy_europe/2024-11-15-mgnify-v5-group2.md

bluesky-galaxyproject

📝 New blog post Released!
https://galaxyproject.org/news/2024-11-15-mgnify-v5/

MGnify amplicon pipeline v5.0

The microbial world is vast and complex, and the MGnify amplicon pipeline has long been a cornerstone for processing and analyzing 16S, 18S, and ITS rRNA gene amplicon sequences. (1/13)

Its version 5.0 brings advanced features and enhanced accuracy to microbiome research. But how can this pipeline be made even more accessible to researchers worldwide? The answer: porting it to Galaxy.

: https://docs.mgnify.org/src/docs/analysis.html

This diagram illustrates the MGnify (2/13)

amplicon pipeline v5.0 workflow for microbial amplicon analysis. It starts by processing paired-end or single-end reads, merging and trimming them with SeqPrep and Trimmomatic, followed by quality control checks and filtering.

The pipeline then splits into two paths: rRNA prediction and ITS (3/13)

sequence analysis. For rRNA-prediction, sequences are identified using the Rfam database and classified into SSU and LSU regions. ITS sequences are separated, masked with bedtools, and classified with databases like UNITE and SILVA.

Finally, all taxonomic results are visualized with Krona, (4/13)

providing an interactive view of microbial composition. This setup enables efficient, thorough microbial profiling of amplicon data.

Availability on Galaxy

Version 5.0 of the MGnify amplicon pipeline is now fully available on the Galaxy EU server, bringing advanced microbial analysis (5/13)

workflows to researchers through an accessible, web-based interface.

The pipeline has been ported into five modular sub-workflows, allowing users flexibility in their analysis:

  • Quality control for single-end reads
  • Quality control for paired-end reads
  • rRNA-prediction
  • ITS analysis
  • (6/13)

Summary tables

These sub-workflows can be run individually or combined to execute the entire pipeline, giving researchers the freedom to tailor the analysis to their specific needs. This modular design ensures accessibility and adaptability for diverse microbiome research projects. The (7/13)

pipeline is designed to take NCBI reads as input, but users can easily modify it to work with their own data uploaded directly to the Galaxy history. In addition to the main sub-workflows, two additional workflows have been created to simplify downstream analysis:

  • A workflow that (8/13)

generates summary tables at a user-specified taxonomic rank

  • A workflow that formats MAPseq results to produce ampvis2-compatible input

These workflows make it easier for users to customize and visualize their analysis results, enhancing the overall utility of the pipeline.

Testing the (9/13)

pipeline

During the process, the ported pipeline was tested against its counterpart, yielding promising results. (10/13)

The figure above compares the relative abundance of taxa between the MGnify and Galaxy pipelines. It shows the results from 10 randomly selected human gut samples. Taxa that were present in Galaxy but absent in MGnify were removed to emphasize the discrepancies. Overall, the results (11/13)

demonstrate highly similar patterns between the two pipelines, with minor discrepancies attributed to the non-deterministic nature of the MAPseq approach.

What's Next?

The successful port of MGnify pipeline v5.0 is just the beginning. Progress is being made in porting the new version, (12/13)

v6.0, to Galaxy. This version is more sophisticated, includes additional sub-workflows and tools, utilizes a broader range of databases, and features an updated version of MAPseq with a seed option.

@galaxyproject.bsky.social
#UseGalaxy #GalaxyProject #UniFreiburg #EOSC #EuroScienceGateway (13/13)


MGnify_v5.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

0 participants