Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005100951 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 46847048 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGACGTCATATCTCGTATG | 132379 | 0.28257703665767797 | TruSeq Adapter, Index 6 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 49505 | 0.0 | 42.696903 | 1 |
TCGGAAG | 50225 | 0.0 | 42.224556 | 2 |
CGGAAGA | 52015 | 0.0 | 40.79914 | 3 |
AGAGCAC | 75535 | 0.0 | 28.322676 | 7 |
AGCACAC | 78425 | 0.0 | 27.29778 | 9 |
GAGCACA | 80325 | 0.0 | 26.660791 | 8 |
GAAGAGC | 83700 | 0.0 | 25.559422 | 5 |
TATGCCG | 23420 | 0.0 | 20.603897 | 45-49 |
AAGAGCA | 107290 | 0.0 | 20.210194 | 6 |
CTCGTAT | 24810 | 0.0 | 19.50818 | 40-44 |
CGACGTC | 24580 | 0.0 | 19.48398 | 30-34 |
ACGACGT | 24720 | 0.0 | 19.226997 | 30-34 |
CGTATGC | 24720 | 0.0 | 19.209486 | 45-49 |
TCTCGTA | 24665 | 0.0 | 19.105545 | 40-44 |
ATGCCGT | 25830 | 0.0 | 18.535557 | 45-49 |
CACGACG | 25830 | 0.0 | 18.23796 | 30-34 |
TATCTCG | 25795 | 0.0 | 17.886356 | 40-44 |
GCCGTCT | 26340 | 0.0 | 17.786274 | 50-54 |
GGAAGAG | 122090 | 0.0 | 17.761677 | 4 |
CGTCATA | 26860 | 0.0 | 17.640776 | 35-39 |