K562_vivo_rep1_2.sorted.bam

command: /Share/home/zhangqf/lipan/usr/icSHAPE-pipe-1.0.0/bin/Functions/samStatistics.py -i 5.map_genome/K562_vivo_rep1_2.sorted.bam -o SamStatistic/K562_vivo_rep1_2 --fast -g /150T/zhangqf/GenomeAnnotation/Gencode/hg38.genomeCoor.bed

1. Basic Information

There are a total of 266,017,229 reads, and the sample rate is 0.004

2. Mapping of data

The following table and pie chart count the mapping rate of reads and the ratio of mapping to annotated genes and intergenic regions.

Type Sampled number Est. ratio
exon564,91756.48%
un-mapped248,56624.85%
intron143,53214.35%
intergenic42,5204.25%
un-annotated6360.06%

3. Mapping on Exon

The table below and the pie chart record the proportion of reads on each gene exon.

Type Sampled number Est. ratio
protein_coding397,46770.36%
retained_intron70,95012.56%
nonsense_mediated_decay38,5356.82%
processed_transcript37,7116.68%
lincRNA10,0391.78%
antisense3,3000.58%
processed_pseudogene3,0270.54%
misc_RNA6480.11%
Mt_rRNA6290.11%
TEC4430.08%
sense_intronic3940.07%
unprocessed_pseudogene3090.05%
snRNA2390.04%
transcribed_unprocessed_pseudogene2050.04%
non_stop_decay1770.03%
sense_overlapping1740.03%
Mt_tRNA1520.03%
snoRNA1260.02%
transcribed_processed_pseudogene1090.02%
ribozyme750.01%
3prime_overlapping_ncRNA630.01%
non_coding320.01%
transcribed_unitary_pseudogene190.00%
miRNA170.00%
unitary_pseudogene170.00%
macro_lncRNA160.00%
scaRNA140.00%
rRNA110.00%
IG_V_pseudogene30.00%
pseudogene20.00%
TR_V_pseudogene10.00%
IG_V_gene10.00%

4. Mapping on Intron

The table below and the pie chart record the proportion of reads on each gene intron.

Type Sampled number Est. ratio
protein_coding90,58563.11%
processed_transcript17,17411.97%
nonsense_mediated_decay12,5438.74%
lincRNA10,1057.04%
retained_intron7,3435.12%
antisense4,7433.30%
transcribed_unprocessed_pseudogene2970.21%
unprocessed_pseudogene2570.18%
sense_overlapping2550.18%
sense_intronic770.05%
non_stop_decay540.04%
transcribed_unitary_pseudogene360.03%
transcribed_processed_pseudogene240.02%
processed_pseudogene160.01%
unitary_pseudogene60.00%
3prime_overlapping_ncRNA50.00%
bidirectional_promoter_lncRNA40.00%
IG_V_pseudogene10.00%

5. Reads mutation profile

The histogram below shows the average mutation rate of each base on the map region of reads.

mutation profile

6. Reads Length

The figure below shows the distribution of the sampled read length.

read length