Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004913508 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 13228680 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 44 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GGCTGCGACATCTGTCACCCCATTGATCGCCAGGGTTGATTCGGCTGATC | 21761 | 0.16449864990308935 | No Hit |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCACGATATCTCGTATG | 14317 | 0.10822697351512017 | TruSeq Adapter, Index 7 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGACCG | 1385 | 0.0 | 52.11938 | 3 |
TGCGACA | 4105 | 0.0 | 49.28353 | 4 |
GCGACAT | 4085 | 0.0 | 49.17606 | 5 |
CCGCTCG | 1760 | 0.0 | 41.28423 | 7 |
CTGCGAC | 4905 | 0.0 | 41.148624 | 3 |
CGACATC | 5130 | 0.0 | 39.158714 | 6 |
GCTGCGA | 5245 | 0.0 | 38.8434 | 2 |
GCTCGCG | 1935 | 0.0 | 38.286797 | 9 |
CGACCGC | 2020 | 0.0 | 36.205517 | 4 |
CTCGACC | 2220 | 0.0 | 32.51592 | 2 |
CTCGCGT | 1220 | 0.0 | 30.557318 | 10-11 |
CGCGTCG | 1235 | 0.0 | 30.186522 | 12-13 |
GGCTGCG | 7045 | 0.0 | 29.184555 | 1 |
CCTCGAC | 2555 | 0.0 | 28.686668 | 1 |
CGTCGTA | 930 | 0.0 | 28.599964 | 38-39 |
TCGCGTC | 1315 | 0.0 | 27.80804 | 10-11 |
GACCGCT | 2705 | 0.0 | 27.563719 | 5 |
CGTCGCA | 1360 | 0.0 | 27.062826 | 14-15 |
ACCCGAC | 1725 | 0.0 | 26.15411 | 3 |
ACGGACC | 2750 | 0.0 | 25.90383 | 4 |