Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004973512 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 459765713 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTCCGCGAAATCTCGTATG | 1645525 | 0.35790511416409165 | TruSeq Adapter, Index 6 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 287000 | 0.0 | 98.44757 | 1 |
TCGGAAG | 304395 | 0.0 | 92.7798 | 2 |
CGGAAGA | 326750 | 0.0 | 86.41149 | 3 |
AGAGCAC | 591085 | 0.0 | 47.94114 | 7 |
AGCACAC | 624365 | 0.0 | 45.402027 | 9 |
GAGCACA | 637175 | 0.0 | 44.518826 | 8 |
GAAGAGC | 647150 | 0.0 | 43.977146 | 5 |
AAGAGCA | 904725 | 0.0 | 31.698807 | 6 |
GGAAGAG | 1014560 | 0.0 | 28.396717 | 4 |
TCCGCGA | 223185 | 0.0 | 25.446198 | 30-34 |
CGCGAAA | 221785 | 0.0 | 25.355038 | 35-39 |
CCGCGAA | 226780 | 0.0 | 24.680931 | 30-34 |
TATGCCG | 238945 | 0.0 | 23.943556 | 45-49 |
CTCGTAT | 231215 | 0.0 | 22.503426 | 40-44 |
CGTATGC | 252935 | 0.0 | 22.353241 | 45-49 |
CTCCGCG | 255910 | 0.0 | 22.300434 | 30-34 |
GCGAAAT | 255605 | 0.0 | 22.266844 | 35-39 |
TCTCGTA | 235835 | 0.0 | 22.034304 | 40-44 |
ACTCCGC | 267285 | 0.0 | 21.377962 | 30-34 |
ATGCCGT | 268385 | 0.0 | 21.29226 | 45-49 |