Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005330936 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 23852942 |
Sequences flagged as poor quality | 0 |
Sequence length | 101 |
%GC | 47 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCC | 347953 | 1.4587424897104935 | Illumina Single End PCR Primer 1 (100% over 50bp) |
GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCG | 89030 | 0.3732453631925152 | Illumina Single End PCR Primer 1 (100% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGGA | 16130 | 0.0 | 67.89744 | 2 |
GATCGGG | 16630 | 0.0 | 66.62088 | 1 |
GAGCGGC | 18580 | 0.0 | 54.25575 | 9 |
CGGGAGA | 22320 | 0.0 | 51.52316 | 4 |
AGAGCGG | 20760 | 0.0 | 51.31907 | 8 |
GAGAGCG | 15660 | 0.0 | 47.028606 | 7 |
TCGGGAG | 27645 | 0.0 | 41.177372 | 3 |
CGCCGGA | 14150 | 0.0 | 37.86557 | 46-47 |
CCGGATC | 16100 | 0.0 | 36.962727 | 48-49 |
GGCGCCG | 23460 | 0.0 | 36.513893 | 44-45 |
TCGGGGG | 41070 | 0.0 | 36.169018 | 38-39 |
GCCGGAT | 16535 | 0.0 | 34.57928 | 46-47 |
TCTCGGG | 25670 | 0.0 | 34.00939 | 36-37 |
AGCGGCG | 16030 | 0.0 | 33.799828 | 10-11 |
GGGTCGC | 11645 | 0.0 | 32.985706 | 42-43 |
CATTAAA | 137145 | 0.0 | 32.700005 | 54-55 |
ATCATTA | 129980 | 0.0 | 31.869818 | 52-53 |
GCGCCGG | 12940 | 0.0 | 30.50171 | 44-45 |
GTATCAT | 113980 | 0.0 | 29.926779 | 50-51 |
GGGCGCC | 33990 | 0.0 | 29.843027 | 42-43 |