Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005002686 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 390477635 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGGCTATAGTGTAGATCTC | 2037067 | 0.5216859603239504 | Illumina Single End PCR Primer 1 (96% over 33bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGCGTCG | 380760 | 0.0 | 83.91854 | 9 |
GAGCGTC | 409490 | 0.0 | 78.12361 | 8 |
ATCGGAA | 429715 | 0.0 | 75.363396 | 1 |
TCGGAAG | 446915 | 0.0 | 72.4478 | 2 |
AGAGCGT | 444725 | 0.0 | 71.97667 | 7 |
CGGAAGA | 467530 | 0.0 | 69.17158 | 3 |
AAGAGCG | 493350 | 0.0 | 65.4305 | 6 |
GAAGAGC | 741200 | 0.0 | 43.90064 | 5 |
GGAAGAG | 1076855 | 0.0 | 30.529722 | 4 |
CGCCGTA | 304725 | 0.0 | 21.171707 | 55-59 |
TCGCCGT | 311200 | 0.0 | 20.623585 | 55-59 |
GCCGTAT | 322430 | 0.0 | 20.061304 | 55-59 |
GTCGCCG | 333700 | 0.0 | 19.555384 | 55-59 |
TGGTCGC | 344415 | 0.0 | 19.080112 | 50-54 |
CGTATCA | 338100 | 0.0 | 18.950687 | 60-64 |
CCGTATC | 331655 | 0.0 | 18.900936 | 55-59 |
GTGGTCG | 359645 | 0.0 | 18.345083 | 50-54 |
GGTCGCC | 350420 | 0.0 | 18.279848 | 50-54 |
GCGTCGT | 369375 | 0.0 | 17.72656 | 10-14 |
CGTCGTG | 372645 | 0.0 | 17.598242 | 10-14 |