Bioinformatics Data Analysis

Results generated by running the recipe.

Parameters used during the run:

  • Genome Accession Number:NZ_CP008918
  • SRA Run Number:SRR4124989
Output Messages
Messages printed to the standard output stream:
Read 1047822 spots for SRR4124989
Written 1047822 spots for SRR4124989
*** Sequence statistics
file                      format  type   num_seqs      sum_len  min_len  avg_len  max_len
reads/SRR4124989_1.fastq  FASTQ   DNA   1,047,822  105,830,022      101      101      101
reads/SRR4124989_2.fastq  FASTQ   DNA   1,047,822  105,830,022      101      101      101
*** 
Other Messages
Messages printed to the standard error stream:
+ ACC=NZ_CP008918
+ SRR=SRR4124989
+ mkdir -p ref
+ REF=ref/NZ_CP008918.fa
+ efetch -db nuccore -id NZ_CP008918 -format fasta
+ mkdir -p reads
+ fastq-dump --origfmt --split-files -O reads SRR4124989
+ echo '*** Sequence statistics'
+ seqkit stat reads/SRR4124989_1.fastq reads/SRR4124989_2.fastq
+ echo '*** '
+ CPUS=4
+ R1=reads/SRR4124989_1.fastq
+ R2=reads/SRR4124989_2.fastq
+ samtools sort -l 0 --threads 4
+ bcftools mpileup -Ou -B --min-MQ 60 -f ref/NZ_CP008918.fa -
+ minimap2 -a -x sr -t 4 ref/NZ_CP008918.fa reads/SRR4124989_1.fastq reads/SRR4124989_2.fastq
+ bcftools norm -Ou -f ref/NZ_CP008918.fa -d all -
+ bcftools filter -Ov -e 'QUAL<40 || DP<10 || GT!="1/1"'
+ bcftools call -Ou -v -m -
Note: none of --samples-file, --ploidy or --ploidy-file given, assuming all sites are diploid
[M::mm_idx_gen::0.049*0.98] collected minimizers
[M::mm_idx_gen::0.059*1.43] sorted minimizers
[M::main::0.059*1.43] loaded/built the index for 1 target sequence(s)
[M::mm_mapopt_update::0.059*1.43] mid_occ = 1000
[M::mm_idx_stat] kmer size: 21; skip: 11; is_hpc: 0; #seq: 1
[M::mm_idx_stat::0.062*1.42] distinct minimizers: 372863 (99.45% are singletons); average occurrences: 1.017; average spacing: 5.993
[M::worker_pipeline::2.995*4.09] mapped 495050 sequences
[M::worker_pipeline::4.439*4.24] mapped 495050 sequences
[M::worker_pipeline::5.816*4.29] mapped 495050 sequences
[M::worker_pipeline::7.124*3.92] mapped 495050 sequences
[M::worker_pipeline::7.361*3.81] mapped 115444 sequences
[M::main] Version: 2.12-r827
[M::main] CMD: minimap2 -a -x sr -t 4 ref/NZ_CP008918.fa reads/SRR4124989_1.fastq reads/SRR4124989_2.fastq
[M::main] Real time: 7.365 sec; CPU: 28.036 sec
[bam_sort_core] merging from 0 files and 4 in-memory blocks...
[mpileup] 1 samples in 1 input files
Lines   total/split/realigned/skipped:	45273/0/336/0
+ bcftools stats variants.vcf
+ plot-vcfstats -P -p plots variants-stats.txt
Parsing bcftools stats output: variants-stats.txt
Plotting graphs: python plot.py

Powered by the release 2.3.6