Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005310486 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 10049906 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCCG | 99062 | 0.9857007617782694 | TruSeq Adapter, Index 1 (100% over 51bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATCACGATCTCGTATGCC | 13808 | 0.13739431990707177 | TruSeq Adapter, Index 1 (100% over 51bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 12260 | 0.0 | 41.863132 | 45 |
GATCGGA | 1960 | 0.0 | 38.711006 | 1 |
ATCGGAA | 13735 | 0.0 | 38.06222 | 1 |
TCGGAAG | 13775 | 0.0 | 37.906555 | 2 |
CGGAAGA | 13950 | 0.0 | 37.33199 | 3 |
CTCGTAT | 13900 | 0.0 | 36.775448 | 41 |
CGTATGC | 13970 | 0.0 | 36.703915 | 43 |
CGTCTGA | 14040 | 0.0 | 36.568264 | 15 |
ACACGTC | 14050 | 0.0 | 36.55825 | 12 |
TCTCGTA | 13350 | 0.0 | 36.537624 | 40 |
TCGTATG | 14085 | 0.0 | 36.42021 | 42 |
ATCTCGT | 13395 | 0.0 | 36.314823 | 39 |
ACGTCTG | 14200 | 0.0 | 36.14038 | 14 |
TCACGAT | 13530 | 0.0 | 36.019176 | 34 |
CATCACG | 14245 | 0.0 | 35.869526 | 32 |
CACGTCT | 14395 | 0.0 | 35.6977 | 13 |
GCACACG | 14575 | 0.0 | 35.365772 | 10 |
CACACGT | 14590 | 0.0 | 35.28227 | 11 |
ACGATCT | 13740 | 0.0 | 35.239235 | 36 |
CACGATC | 13775 | 0.0 | 35.149876 | 35 |