Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004913396 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18033202 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 46 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGCTGCGACATCTGTCACCCCATTGATCGCCAGGGTTGATTCGGCTGATC | 88522 | 0.49088342713623456 | No Hit |
GGCTGGCTAGGCGGGTGTCCCCTTCCTCCCTCACCGCTCCATGTGCGTCC | 67698 | 0.37540753993661247 | No Hit |
GAACCCGACTCCCTTTCGATCGGCCGAGGGCAACGGAGGCCATCGCCCGT | 31473 | 0.17452807327284417 | No Hit |
GGCTAGGCGGGTGTCCCCTTCCTCCCTCACCGCTCCATGTGCGTCCCTCC | 24087 | 0.13357028884831434 | No Hit |
CTCAGGACCGACTGACCCATGTTCAACTGCTGTTCACATGGAAGCCTTCT | 21279 | 0.11799901093549553 | No Hit |
AGCCCTTAGAGCCAATCCTTATCCCGAAGTTACGGATCCGGCTTGCCGAC | 20729 | 0.11494908114487931 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TGCGACA | 11155 | 0.0 | 78.33394 | 4 |
GCGACAT | 11295 | 0.0 | 77.61528 | 5 |
CTGCGAC | 12280 | 0.0 | 71.544304 | 3 |
GCTGCGA | 13365 | 0.0 | 65.94939 | 2 |
CGACATC | 14675 | 0.0 | 59.771 | 6 |
GGCTGCG | 15985 | 0.0 | 55.612965 | 1 |
ACCCGAC | 6635 | 0.0 | 49.315113 | 3 |
CCCGACT | 7105 | 0.0 | 46.052887 | 4 |
CCGACTC | 7195 | 0.0 | 45.608833 | 5 |
CGACTCC | 7455 | 0.0 | 43.763374 | 6 |
GACATCT | 21170 | 0.0 | 41.567844 | 7 |
ACATCTG | 21300 | 0.0 | 41.29391 | 8 |
AACCCGA | 8745 | 0.0 | 37.742157 | 2 |
CATCTGT | 24200 | 0.0 | 36.32584 | 9 |
TTCGGCT | 12545 | 0.0 | 35.035625 | 40-41 |
ATTCGGC | 12580 | 0.0 | 34.957027 | 38-39 |
TGATCGC | 12860 | 0.0 | 34.233074 | 24-25 |
TGATTCG | 12940 | 0.0 | 33.874397 | 36-37 |
TCGGCTG | 13025 | 0.0 | 33.653336 | 40-41 |
GAACCCG | 10535 | 0.0 | 33.111683 | 1 |