Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005416774 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 12553031 |
Sequences flagged as poor quality | 0 |
Sequence length | 125 |
%GC | 32 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CNGGCGCGGTGGTTTACGTTTGTAATTTTAGTATTTTGGGAGGTCGAGGC | 46339 | 0.36914590587723395 | No Hit |
CNGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATCGTGTTAGTT | 23486 | 0.18709425635928087 | No Hit |
GNTCGGAAGAGCACACGTCTGAACTCCAGTCACACAGTGATCTCGTATGC | 22992 | 0.18315895181012457 | TruSeq Adapter, Index 5 (98% over 50bp) |
CNGTTAATTTTTTGTATTTTTAGTAGAGACGGGGTTTTATTTTGTTAGTT | 13724 | 0.10932817739396963 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGCGCGG | 27390 | 0.0 | 97.86752 | 3 |
CGCGGTG | 28590 | 0.0 | 94.066505 | 5 |
GCGCGGT | 28165 | 0.0 | 93.36863 | 4 |
CGGTGGT | 49310 | 0.0 | 91.17644 | 7 |
CAGAAGA | 1460 | 0.0 | 88.09701 | 4 |
GCTAATT | 525 | 0.0 | 87.97497 | 3 |
GCGGTGG | 40840 | 0.0 | 87.88266 | 6 |
GCAGAAG | 1610 | 0.0 | 77.71041 | 3 |
ACGGTGG | 13370 | 0.0 | 73.02581 | 6 |
GGTGCGG | 11375 | 0.0 | 71.80196 | 3 |
GGTACGG | 13965 | 0.0 | 70.12376 | 3 |
TAAGCGA | 9135 | 0.0 | 69.952446 | 7 |
CGTAGTG | 16210 | 0.0 | 69.93364 | 5 |
AAGCGAT | 9470 | 0.0 | 68.218735 | 8 |
GGCGTGG | 23130 | 0.0 | 67.7661 | 3 |
GGCGTAG | 16995 | 0.0 | 67.11621 | 3 |
GAAGATC | 3545 | 0.0 | 66.79287 | 6 |
GGAGGCG | 24515 | 0.0 | 65.82161 | 3 |
CGTGGTG | 23475 | 0.0 | 65.10155 | 5 |
GGCGGAG | 24625 | 0.0 | 65.02901 | 6 |