Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004961646 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 35725660 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAACGATCTCGTAT | 79769 | 0.223282089120257 | TruSeq Adapter, Index 19 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACGTGAAACGATCTCGTA | 74287 | 0.20793737610445825 | TruSeq Adapter, Index 19 (97% over 40bp) |
CTCAGATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAACGGTAA | 46510 | 0.13018653819131684 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 9960 | 0.0 | 36.89769 | 44 |
AGATCGG | 12565 | 0.0 | 28.84168 | 1 |
TTGAACG | 7580 | 0.0 | 27.369097 | 7 |
TCGAACG | 8165 | 0.0 | 24.598444 | 40 |
GAACGCT | 8715 | 0.0 | 23.273975 | 9 |
ACGGTAA | 8620 | 0.0 | 23.25697 | 44 |
AACGCTG | 8820 | 0.0 | 22.872192 | 10 |
TCGACTA | 3200 | 0.0 | 22.487427 | 44 |
GTCGAAC | 9190 | 0.0 | 21.974562 | 39 |
AACGGTA | 9155 | 0.0 | 21.842312 | 43 |
TATGCCG | 3210 | 0.0 | 21.660362 | 3 |
CATACGA | 2285 | 0.0 | 21.468904 | 15 |
CGAACGG | 9445 | 0.0 | 21.218246 | 41 |
GAACGGT | 9520 | 0.0 | 21.074194 | 42 |
CGTATGC | 3350 | 0.0 | 21.043709 | 1 |
AGTCGAA | 9610 | 0.0 | 21.014177 | 38 |
TGAACGC | 9815 | 0.0 | 20.91266 | 8 |
GCCTAAC | 9660 | 0.0 | 20.791456 | 24 |
ATTGAAC | 10975 | 0.0 | 20.506384 | 6 |
GCATACG | 2525 | 0.0 | 19.776787 | 14 |