Skip to content

hisplan/10k-ft-overview-seqc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

10k-ft-overview-seqc

10,000-ft Overview of SEQC

Code Overview

Main Function

run.py

def run(args) -> None:
  ...

Program Arguments

args

Platform

Platform-specific code

  • 10x v2
  • 10x v3
  • InDrop
  • ...

Determine Starting Point

  • From FASTQ (.fastq)
  • From aligned BAM (.bam)
  • From ReadArray object (.h5)

Download

  • Inputs

Merge FASTQ

  • READ1 + READ2

Alignment

  • Internally using STAR

Creating ReadArray Object

  • H5 object
  • Explore the ReadArray object
    • pysam and pyranges are not required for exploring ReadArray object, but required for BAM/GTF lookup.

Cell Barcode Correction

Resolve Ambiguous Alignments

  • Multimapping

UMI Correction

  • Dask

Sparse Count Matrix

  • mtx and csv

Filtering

  • Four filters

Dense Count Matrix

  • csv

Development/Debug Environment

Setting Up Dev Envionrment

https://github.com/dpeerlab/seqc/blob/master/docs/install-dev.md

Testing & CI/CD

https://github.com/dpeerlab/seqc/blob/master/docs/run-test.md

  • nosetest
  • CicleCI
  • GitHub Actions

Packaging

  • Docker
  • Packer
  • WDL

About

10,000-ft Overview of SEQC

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published