Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005002410 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 429202774 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTGCCTCTATGTGTAGATCTC | 579680 | 0.13505970490302563 | Illumina Single End PCR Primer 1 (96% over 31bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGCGTCG | 266645 | 0.0 | 34.61659 | 9 |
GAGCGTC | 300005 | 0.0 | 30.781746 | 8 |
ATCGGAA | 322055 | 0.0 | 29.773401 | 1 |
TCGGAAG | 341295 | 0.0 | 27.95952 | 2 |
AGAGCGT | 337285 | 0.0 | 27.482767 | 7 |
CGGAAGA | 362545 | 0.0 | 26.219885 | 3 |
AAGAGCG | 394795 | 0.0 | 23.999197 | 6 |
GAAGAGC | 666680 | 0.0 | 14.635511 | 5 |
CGCCGTA | 172605 | 0.0 | 10.718275 | 55-59 |
TCGCCGT | 181205 | 0.0 | 10.154377 | 55-59 |
GCCGTAT | 190030 | 0.0 | 9.860579 | 55-59 |
GGAAGAG | 1029785 | 0.0 | 9.801881 | 4 |
GTCGCCG | 204395 | 0.0 | 9.31086 | 55-59 |
CCGTATC | 199860 | 0.0 | 9.212367 | 55-59 |
TGGTCGC | 212780 | 0.0 | 9.07203 | 50-54 |
CGTATCA | 206205 | 0.0 | 9.033806 | 60-64 |
GGTCGCC | 223355 | 0.0 | 8.598363 | 50-54 |
GTGGTCG | 228770 | 0.0 | 8.522226 | 50-54 |
GCGTCGT | 255750 | 0.0 | 7.3161306 | 10-14 |
CGTCGTG | 259855 | 0.0 | 7.247425 | 10-14 |