singlecellmultiomics.universalBamTagger package¶
Submodules¶
singlecellmultiomics.universalBamTagger.4SUtagger module¶
singlecellmultiomics.universalBamTagger.bamtagmultiome module¶
-
singlecellmultiomics.universalBamTagger.bamtagmultiome.run_multiome_tagging(args)[source]¶ Run multiome tagging adds molecule information
Parameters: - bamin (str) – bam file to process
- o (str) – path to output bam file
- method (str) – Protocol to tag, select from:nla, qflag, chic, nla_transcriptome, vasa, cs, nla_taps ,chic_taps, nla_no_overhang
- qflagger (str) – Query flagging algorithm to use, this algorithm extracts UMI and sample information from your reads. When no query flagging algorithm is specified, the singlecellmultiomics.universalBamTagger.universalBamTagger.QueryNameFlagger is used
- method – Method name, what kind of molecules need to be extracted. Select from: nla qflag chic nla_transcriptome vasa cs nla_taps chic_taps
- custom_flags (str) – Arguments passed to the query name flagger, comma separated “MI,RX,BI,SM”
- ref (str) – Path to reference fasta file, autodected from bam header when not supplied
- umi_hamming_distance (int) – Max hamming distance on UMI’s
- head (int) – Amount of molecules to process
- contig (str) – only process this contig
- region_start (int) – Zero based start coordinate of single region to process
- region_end (int) – Zero based end coordinate of single region to process, None: all contigs when contig is not set, complete contig when contig is set.
- alleles (str) – path to allele VCF
- allele_samples (str) – Comma separated samples to extract from the VCF file. For example B6,SPRET
- unphased_alleles (str) – Path to VCF containing unphased germline SNPs
- mapfile (str) – ‘Path to *.safe.bgzf file, used to decide if molecules are uniquely mappable, generate one using createMapabilityIndex.py
- annotmethod (int) – Annotation resolving method. 0: molecule consensus aligned blocks. 1: per read per aligned base
- cluster (bool) – Run contigs in separate cluster jobs
- no_rejects (bool) – Do not write rejected reads
- mem (int) – Amount of gigabytes to request for cluster jobs
- time (int) – amount of wall clock hours to request for cluster jobs
- exons (str) – Path to exon annotation GTF file
- introns (str) – Path to intron annotation GTF file
- consensus (bool) – Calculate molecule consensus read, this feature is _VERY_ experimental
- consensus_model (str) – Path to consensus calling model, when none specified, this is learned based on the supplied bam file, ignoring sites supplied by -consensus_mask_variants
- consensus_mask_variants (str) – Path VCF file masked for training on consensus caller
- consensus_n_train (int) – Amount of bases used for training the consensus model
- no_source_reads (bool) – Do not write original reads, only consensus
singlecellmultiomics.universalBamTagger.customreads module¶
-
class
singlecellmultiomics.universalBamTagger.customreads.CustomAssingmentQueryNameFlagger(block_assignments, **kwargs)[source]¶ Bases:
singlecellmultiomics.universalBamTagger.digest.DigestFlaggerThis query name flagger converts values between colons “:” to tags
-
class
singlecellmultiomics.universalBamTagger.customreads.VaninsbergheQueryNameFlagger(**kwargs)[source]¶ Bases:
singlecellmultiomics.universalBamTagger.digest.DigestFlagger
singlecellmultiomics.universalBamTagger.digest module¶
singlecellmultiomics.universalBamTagger.mspjI module¶
singlecellmultiomics.universalBamTagger.nlaIII module¶
singlecellmultiomics.universalBamTagger.rna module¶
singlecellmultiomics.universalBamTagger.scar module¶
singlecellmultiomics.universalBamTagger.scchic module¶
singlecellmultiomics.universalBamTagger.tag module¶
singlecellmultiomics.universalBamTagger.taps module¶
-
class
singlecellmultiomics.universalBamTagger.taps.TAPSFlagger(reference, **kwargs)[source]¶ Bases:
singlecellmultiomics.universalBamTagger.digest.DigestFlagger
singlecellmultiomics.universalBamTagger.tapsTabulator module¶
singlecellmultiomics.universalBamTagger.tapsTagger module¶
singlecellmultiomics.universalBamTagger.universalBamTagger module¶
-
class
singlecellmultiomics.universalBamTagger.universalBamTagger.AlleleTagger(**kwargs)[source]¶ Bases:
singlecellmultiomics.universalBamTagger.digest.DigestFlagger
-
class
singlecellmultiomics.universalBamTagger.universalBamTagger.MoleculeIterator_OLD(alignmentfile, look_around_radius=100000, umi_hamming_distance=0, sample_select=None, **pysam_kwargs)[source]¶ Bases:
objectIterate over molecules in a bam file
Parameters: - alignmentfile (pysam.AlignmentFile) – file to read the molecules from
- look_around_radius (int) – buffer to accumulate molecules in. All fragments belonging to one molecule should fit this radius
- umi_hamming_distance (int) – Edit distance on UMI, 0: only exact match, 1: single base distance
- sample_select (iterable) – Iterable of samples to only select molecules from
Yields: - list of molecules (list [ pysam.AlignedSegment ])
- [ (R1,R2), (R1,R2) … ]
-
class
singlecellmultiomics.universalBamTagger.universalBamTagger.QueryNameFlagger(**kwargs)[source]¶ Bases:
singlecellmultiomics.universalBamTagger.digest.DigestFlagger
-
class
singlecellmultiomics.universalBamTagger.universalBamTagger.TranscriptIterator(look_around_radius=100, informative_read=2, assignment_radius=10, **kwargs)[source]¶ Bases:
singlecellmultiomics.universalBamTagger.universalBamTagger.MoleculeIterator_OLD