Skip to content

Low Sample Matching in MICOM Model #207

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
dorarinyo88 opened this issue Feb 19, 2025 · 1 comment
Open

Low Sample Matching in MICOM Model #207

dorarinyo88 opened this issue Feb 19, 2025 · 1 comment

Comments

@dorarinyo88
Copy link

I am trying to build a MICOM model using abundance data generated from Paprica, with preprocessing applied to match MICOM’s expected format. However, I am encountering the following warning when running MICOM:
Less than 50% of the abundance could be matched to the model database. Model community.py:229 sample_1105 may not be representative of the sample.

Additionally, when checking the manifest file, I noticed that none of my taxa were found in the MICOM model database, leading to a found_abundance_fraction of 0.0 for all samples.

Dataset Information

  • The taxonomy and abundance values were generated using Paprica.
  • Preprocessing was done to format it for MICOM.
  • Here is an example of my dataset before processing:

Image

The manifest file shows that no taxa were matched in MICOM:

Image

Would appreciate any guidance on resolving this issue. Thanks!

@cdiener
Copy link
Collaborator

cdiener commented Feb 28, 2025

Hi,

So your taxa don't look right. The genus seems to be wrong for all of them. funiformis is Megamonas and Firmicutes is a phylum not a genus. copri would the Prevotella genus. MICOM always matches on as many ranks as possible, so in that case none of the genera match.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants