Standardizing inputs to AIRR format

With multiple potential sources of TCR count tables, the pipeline should be able to handle inputs beyond Adaptive. Particularly relevant is taking outputs from CellRanger for scRNA TCR sequencing, to enable pseudobulking of single-cell data (see #33).

Bare minimum columns:

- **full nucleotide sequence**
- full nucleotide aa sequence (can infer from full nuc seq)
- CDR3 nucleotide (can infer from full nuc seq)
- CDR3 aa sequence (can infer from full nuc seq)
- **count**
- **VJ identifications**

We will need to agree on a standard to conform Adaptive and any other imported count tables to; [AIRR](https://pmc.ncbi.nlm.nih.gov/articles/PMC6173121/) is a possible standard widely-adopted by the open source community. Other pipelines have been certified as [AIRR-compliant](https://docs.airr-community.org/en/stable/swtools/airr_swtools_compliant.html), and I will investigate how we can conform our inputs to this standard.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Standardizing inputs to AIRR format #32

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Standardizing inputs to AIRR format #32

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions