Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004723680 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 18546937 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 43 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CTGGAGTCTTGGAAGCTTGACTACCCTACGTTCTCCTACAAATGGACCTT | 106457 | 0.5739869607579947 | No Hit |
GGAAGCTTGACTACCCTACGTTCTCCTACAAATGGACCTTGAGAGCTTGT | 54366 | 0.2931265685541499 | No Hit |
CTTGGAAGCTTGACTACCCTACGTTCTCCTACAAATGGACCTTGAGAGCT | 46659 | 0.2515725372874238 | No Hit |
GGAGTCTTGGAAGCTTGACTACCCTACGTTCTCCTACAAATGGACCTTGA | 36032 | 0.1942746664853609 | No Hit |
GTCTTGGAAGCTTGACTACCCTACGTTCTCCTACAAATGGACCTTGAGAG | 31482 | 0.16974231378475055 | No Hit |
GGCTGGAGTGCAGTGGCTATTCACAGGCGCGATCCCACTACTGATCAGCA | 20325 | 0.10958682827250667 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGCTATG | 2535 | 0.0 | 37.623753 | 1 |
ACCGCGT | 970 | 0.0 | 36.292473 | 6 |
CCGCGTT | 1020 | 0.0 | 34.513336 | 7 |
TGGAGTC | 32295 | 0.0 | 32.413353 | 2 |
GGAGTCT | 34175 | 0.0 | 30.696758 | 3 |
GACCGCG | 1145 | 0.0 | 30.335726 | 5 |
TGCGCGG | 2020 | 0.0 | 28.813509 | 7 |
GAGTCTT | 36875 | 0.0 | 28.653027 | 4 |
GTGCGCG | 2090 | 0.0 | 27.84854 | 6 |
AGTCTTG | 40540 | 0.0 | 26.560598 | 5 |
CGCGGAC | 2205 | 0.0 | 26.395979 | 9 |
CGCGTTC | 1345 | 0.0 | 26.173683 | 8 |
CTGGAGT | 47740 | 0.0 | 24.987587 | 1 |
GTCTTGG | 43040 | 0.0 | 24.614233 | 6 |
TCTTGGA | 48765 | 0.0 | 22.080608 | 7 |
TCGCGCG | 1200 | 0.0 | 20.339806 | 9 |
TTGGAAG | 51910 | 0.0 | 20.218338 | 9 |
GCGTAAC | 2220 | 0.0 | 20.200354 | 26-27 |
CTTGGAA | 52590 | 0.0 | 20.162245 | 8 |
GCGCGGA | 3150 | 0.0 | 18.924265 | 8 |