Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004691363 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20428940 |
Sequences flagged as poor quality | 0 |
Sequence length | 51 |
%GC | 42 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATTGGCTCATCTCGTATG | 1606452 | 7.863609174044273 | TruSeq Adapter, Index 12 (97% over 37bp) |
ATCGGAAGAGCACACGTCTGAACTCCAGTCACATTGGCTCATCTCGTATGC | 374100 | 1.831225702361454 | TruSeq Adapter, Index 12 (97% over 36bp) |
GAAGAGCACACGTCTGAACTCCAGTCACATTGGCTCATCTCGTATGCCGTC | 70119 | 0.34323366753243195 | TruSeq Adapter, Index 12 (96% over 32bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACATTGGCTCACCTCGTATG | 20548 | 0.10058280067394589 | TruSeq Adapter, Index 12 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GATCGGA | 199455 | 0.0 | 40.096493 | 1 |
TACGGCG | 1700 | 0.0 | 38.903744 | 6 |
ACGGCGA | 1765 | 0.0 | 37.725933 | 7 |
CGGCGAC | 1820 | 0.0 | 36.215057 | 8 |
TGCCGTC | 10095 | 0.0 | 35.70875 | 45 |
ATGCCGT | 11470 | 0.0 | 34.01172 | 44 |
CGTATGC | 62810 | 0.0 | 33.739822 | 45 |
ATACGGC | 2010 | 0.0 | 33.57344 | 5 |
TCTCGTA | 230130 | 0.0 | 33.19017 | 43 |
ATCTCGT | 230570 | 0.0 | 33.132607 | 42 |
CATCTCG | 230635 | 0.0 | 33.113846 | 41 |
GGCGACC | 1965 | 0.0 | 33.084538 | 9 |
GCTCATC | 227660 | 0.0 | 33.074688 | 38 |
TCGTATG | 237070 | 0.0 | 33.055386 | 45 |
GCGACCA | 1955 | 0.0 | 33.02364 | 10 |
TCGGAAG | 242190 | 0.0 | 32.986176 | 3 |
CTCGTAT | 236855 | 0.0 | 32.983948 | 44 |
ATCGGAA | 242605 | 0.0 | 32.972443 | 2 |
CGGAAGA | 242160 | 0.0 | 32.943386 | 4 |
CGTCTGA | 240215 | 0.0 | 32.795986 | 16 |