Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005181398 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 14163594 |
Sequences flagged as poor quality | 0 |
Sequence length | 150 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACAACCGAGAATCTCGTAT | 17838 | 0.1259426103289885 | TruSeq Adapter, Index 1 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGTGGTA | 19500 | 0.0 | 85.6532 | 5 |
CAGTGGT | 21455 | 0.0 | 77.8814 | 4 |
GTGGTAT | 22525 | 0.0 | 74.213776 | 6 |
AGCAGTG | 23075 | 0.0 | 72.480934 | 2 |
GGTATCA | 24745 | 0.0 | 67.759575 | 8 |
TGGTATC | 24940 | 0.0 | 67.11431 | 7 |
GTATCAA | 25240 | 0.0 | 66.57331 | 9 |
AAGCAGT | 26065 | 0.0 | 63.94543 | 1 |
GCAGTGG | 31890 | 0.0 | 52.58129 | 3 |
TCTTGGG | 19715 | 0.0 | 16.521896 | 35-39 |
CAACGCA | 21540 | 0.0 | 16.209984 | 10-14 |
GCAGAGT | 21825 | 0.0 | 15.562835 | 15-19 |
GAGCACA | 20285 | 0.0 | 14.835081 | 9 |
GATCGGA | 20450 | 0.0 | 14.71632 | 1 |
AACGCAG | 23635 | 0.0 | 14.651294 | 10-14 |
CCACTCA | 5015 | 0.0 | 14.499035 | 9 |
AGAGCAC | 20635 | 0.0 | 14.443903 | 8 |
TATCAAC | 24590 | 0.0 | 14.275512 | 10-14 |
TCAACGC | 25755 | 0.0 | 13.652136 | 10-14 |
ATCAACG | 26270 | 0.0 | 13.411904 | 10-14 |