Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004684180 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 28951469 |
Sequences flagged as poor quality | 0 |
Sequence length | 35-76 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGC | 1914086 | 6.611360549614943 | TruSeq Adapter, Index 9 (100% over 50bp) |
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 874390 | 3.0201921705596355 | No Hit |
AATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGC | 76944 | 0.26576889759894395 | TruSeq Adapter, Index 9 (98% over 50bp) |
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 31157 | 0.10761802794877179 | No Hit |
AAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGCCGTCTT | 29949 | 0.10344552810083663 | TruSeq Adapter, Index 9 (100% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AAAAGGG | 234810 | 0.0 | 85.639145 | 70 |
GATCGGA | 219530 | 0.0 | 68.48645 | 1 |
ATCGGAA | 234265 | 0.0 | 68.130356 | 2 |
TCGGAAG | 234975 | 0.0 | 67.71669 | 3 |
CGGAAGA | 237960 | 0.0 | 66.968575 | 4 |
ACACGTC | 246020 | 0.0 | 65.184555 | 13 |
TCGTATG | 244795 | 0.0 | 64.712234 | 43 |
ACGTCTG | 246800 | 0.0 | 64.71207 | 15 |
CACGTCT | 247500 | 0.0 | 64.64587 | 14 |
CTCGTAT | 238630 | 0.0 | 64.585 | 42 |
GTCACGA | 246285 | 0.0 | 64.57939 | 29 |
CACACGT | 248560 | 0.0 | 64.57742 | 12 |
GCACACG | 249225 | 0.0 | 64.45858 | 11 |
AGTCACG | 247040 | 0.0 | 64.447 | 28 |
TCACGAT | 247530 | 0.0 | 64.25945 | 30 |
CACGATC | 247350 | 0.0 | 64.24422 | 31 |
ACGATCA | 247920 | 0.0 | 64.20564 | 32 |
CGATCAG | 248055 | 0.0 | 64.132706 | 33 |
TCTCGTA | 238000 | 0.0 | 64.11581 | 41 |
ATCTCGT | 235920 | 0.0 | 63.98327 | 40 |