Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00003249711 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 390803091 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCGT | 884028 | 0.22620803682435564 | Illumina Single End PCR Primer 1 (100% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGCGTCG | 297420 | 0.0 | 48.26773 | 9 |
GAGCGTC | 332170 | 0.0 | 43.387238 | 8 |
AGAGCGT | 367550 | 0.0 | 39.35407 | 7 |
ATCGGAA | 385700 | 0.0 | 38.923183 | 1 |
TCGGAAG | 394720 | 0.0 | 37.86902 | 2 |
CGGAAGA | 412410 | 0.0 | 36.13048 | 3 |
AAGAGCG | 412525 | 0.0 | 35.753395 | 6 |
GAAGAGC | 665055 | 0.0 | 22.622898 | 5 |
CGCCGTA | 167265 | 0.0 | 17.993208 | 45-49 |
GTCGCCG | 189575 | 0.0 | 16.353668 | 40-44 |
GCCGTAT | 188345 | 0.0 | 16.131016 | 45-49 |
TCGCCGT | 178295 | 0.0 | 15.917251 | 40-44 |
GGAAGAG | 998225 | 0.0 | 15.471798 | 4 |
CCGTATC | 197770 | 0.0 | 15.445112 | 45-49 |
CGTATCA | 204445 | 0.0 | 15.0408325 | 45-49 |
GGTCGCC | 209005 | 0.0 | 14.923543 | 40-44 |
TGGTCGC | 210845 | 0.0 | 14.669533 | 40-44 |
GTGGTCG | 225605 | 0.0 | 13.803619 | 40-44 |
TCGGTGG | 267225 | 0.0 | 11.970904 | 35-39 |
TCTCGGT | 265645 | 0.0 | 11.950414 | 35-39 |