Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005110979 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 67748315 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTCGTGCATATCTCGTATG | 95056 | 0.14030754860840453 | TruSeq Adapter, Index 16 (97% over 36bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTCGTGCATATCTCGTAT | 79429 | 0.11724129227420638 | TruSeq Adapter, Index 16 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 30905 | 0.0 | 40.93623 | 1 |
ATCGGAA | 43750 | 0.0 | 37.065422 | 1 |
TCGGAAG | 45550 | 0.0 | 35.688213 | 2 |
CGGAAGA | 48810 | 0.0 | 33.35322 | 3 |
TATGCCG | 27495 | 0.0 | 21.340889 | 45-49 |
CTCGTAT | 29365 | 0.0 | 19.910397 | 40-44 |
TCTCGTA | 28270 | 0.0 | 19.373611 | 40-44 |
CGTATGC | 29825 | 0.0 | 19.28955 | 45-49 |
AGAGCAC | 87955 | 0.0 | 18.9303 | 7 |
ATGCCGT | 31335 | 0.0 | 18.484968 | 45-49 |
TCGTGCA | 32155 | 0.0 | 18.220339 | 30-34 |
AGCACAC | 92375 | 0.0 | 18.1265 | 9 |
TATCTCG | 29875 | 0.0 | 17.905655 | 40-44 |
GAGCACA | 95110 | 0.0 | 17.506199 | 8 |
GCCGTCT | 32100 | 0.0 | 17.399143 | 50-54 |
GAAGAGC | 98210 | 0.0 | 17.048773 | 5 |
ATCTCGT | 32205 | 0.0 | 16.83983 | 40-44 |
ACTCGTG | 35850 | 0.0 | 16.204895 | 30-34 |
CACTCGT | 35545 | 0.0 | 16.193027 | 30-34 |
CTCGTGC | 37530 | 0.0 | 15.761503 | 30-34 |