Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005104636 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 51161949 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 48 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACCTGAAGCTATCTCGTATGC | 339195 | 0.6629829524281806 | TruSeq Adapter, Index 19 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGTATGC | 40575 | 0.0 | 41.381966 | 45 |
TCGTATG | 41340 | 0.0 | 40.75297 | 44 |
ACACGTC | 43490 | 0.0 | 39.173645 | 12 |
CTCGTAT | 42935 | 0.0 | 39.066147 | 43 |
ACGTCTG | 43600 | 0.0 | 38.9871 | 14 |
TATCTCG | 40200 | 0.0 | 38.827053 | 40 |
ATCGGAA | 44790 | 0.0 | 38.283688 | 1 |
TCGGAAG | 46070 | 0.0 | 37.64653 | 2 |
TCTCGTA | 41840 | 0.0 | 37.330647 | 42 |
CGTCTGA | 45565 | 0.0 | 37.30631 | 15 |
ATCTCGT | 42085 | 0.0 | 37.189537 | 41 |
GAAGCTA | 45435 | 0.0 | 37.067253 | 35 |
CGGAAGA | 46845 | 0.0 | 36.87762 | 3 |
CACGTCT | 46260 | 0.0 | 36.842556 | 13 |
CACACGT | 46380 | 0.0 | 36.824802 | 11 |
GCACACG | 47120 | 0.0 | 36.289455 | 10 |
GCTATCT | 43240 | 0.0 | 36.13351 | 38 |
AGCTATC | 43665 | 0.0 | 35.822746 | 37 |
CTATCTC | 44645 | 0.0 | 35.087265 | 39 |
ACCTGAA | 50660 | 0.0 | 33.40171 | 31 |