Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005002587 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 386638959 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 40 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGAATTCGTATCTCGTATG | 2610133 | 0.6750827714700112 | TruSeq Adapter, Index 7 (97% over 34bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 430920 | 0.0 | 108.43996 | 1 |
TCGGAAG | 446230 | 0.0 | 104.570145 | 2 |
CGGAAGA | 467770 | 0.0 | 99.5425 | 3 |
AGAGCAC | 691450 | 0.0 | 67.41771 | 7 |
AGCACAC | 722830 | 0.0 | 64.475494 | 9 |
GAAGAGC | 733375 | 0.0 | 63.71427 | 5 |
GAGCACA | 732795 | 0.0 | 63.553444 | 8 |
AAGAGCA | 950365 | 0.0 | 49.257557 | 6 |
GGAAGAG | 1039425 | 0.0 | 45.21882 | 4 |
TATGCCG | 386890 | 0.0 | 24.558395 | 45-49 |
CTCGTAT | 351265 | 0.0 | 23.964172 | 40-44 |
TCTCGTA | 353335 | 0.0 | 23.79916 | 40-44 |
TCGTATC | 401015 | 0.0 | 23.701132 | 35-39 |
CGTATGC | 398920 | 0.0 | 23.54593 | 45-49 |
ATGCCGT | 403755 | 0.0 | 23.269712 | 45-49 |
TATCTCG | 357965 | 0.0 | 23.15839 | 40-44 |
GTCACGA | 420295 | 0.0 | 22.743511 | 25-29 |
GCCGTCT | 406165 | 0.0 | 22.707632 | 50-54 |
ATCTCGT | 372280 | 0.0 | 22.54403 | 40-44 |
TGCCGTC | 404270 | 0.0 | 22.536747 | 45-49 |