Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005000743 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 55366749 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTGGAGCTGATCTCGTAT | 122453 | 0.2211670401670143 | TruSeq Adapter, Index 10 (97% over 36bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 86045 | 0.0 | 24.16767 | 1 |
TCGGAAG | 88150 | 0.0 | 23.652971 | 3 |
ATCGGAA | 88900 | 0.0 | 23.537743 | 2 |
CGGAAGA | 88810 | 0.0 | 23.493517 | 4 |
TCCGATC | 5050 | 0.0 | 21.390158 | 3 |
AGAGCAC | 111965 | 0.0 | 18.783833 | 8 |
GAGCACA | 115720 | 0.0 | 18.21261 | 9 |
TATGCCG | 29735 | 0.0 | 17.676352 | 45-49 |
GAAGAGC | 122090 | 0.0 | 17.368603 | 6 |
CGTATGC | 32330 | 0.0 | 16.037785 | 45-49 |
CTCGTAT | 32325 | 0.0 | 15.972936 | 40-44 |
ATGCCGT | 32640 | 0.0 | 15.881023 | 45-49 |
TCTCGTA | 31195 | 0.0 | 15.649823 | 40-44 |
GCCGTCT | 33340 | 0.0 | 15.334414 | 50-54 |
CCGATCT | 7270 | 0.0 | 14.559201 | 4 |
AAGAGCA | 147150 | 0.0 | 14.514136 | 7 |
TTCCGAT | 7675 | 0.0 | 14.175345 | 2 |
CGTCTTC | 36855 | 0.0 | 14.025346 | 50-54 |
ATCTCGT | 34445 | 0.0 | 13.987995 | 40-44 |
CCGTCTT | 37475 | 0.0 | 13.750746 | 50-54 |