Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00000818871 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 4000000 |
Sequences flagged as poor quality | 0 |
Sequence length | 100 |
%GC | 26 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACTTAGGCATCTCGTATGCC | 5342 | 0.13355 | TruSeq Adapter, Index 3 (100% over 50bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTAGCTTATCTCGTATGC | 5031 | 0.125775 | TruSeq Adapter, Index 10 (100% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CAAATGC | 15 | 6.904703E-4 | 93.9958 | 94 |
TACTACA | 15 | 6.904703E-4 | 93.9958 | 94 |
AACACAT | 35 | 2.939043E-4 | 53.711884 | 94 |
CACTAGT | 20 | 5.681411E-4 | 46.99555 | 30-31 |
ACTAGTT | 20 | 5.681411E-4 | 46.99555 | 32-33 |
TACTAGC | 40 | 4.649337E-9 | 46.99555 | 30-31 |
CTAGCTT | 810 | 0.0 | 46.41536 | 32-33 |
GTCACTA | 795 | 0.0 | 46.40441 | 28-29 |
CACTAGC | 765 | 0.0 | 46.381226 | 30-31 |
TATCTCG | 860 | 0.0 | 45.90263 | 38-39 |
CTTATCT | 835 | 0.0 | 45.86991 | 36-37 |
AGCTTAT | 840 | 0.0 | 45.59687 | 34-35 |
ACTAGCT | 805 | 0.0 | 43.20088 | 32-33 |
TCACTAG | 790 | 0.0 | 43.12883 | 30-31 |
GCTTATC | 820 | 0.0 | 42.697174 | 36-37 |
CATCTCG | 1065 | 0.0 | 42.582825 | 38-39 |
ACAGTCA | 205 | 0.0 | 42.410618 | 32-33 |
TCACAGT | 200 | 0.0 | 42.295994 | 30-31 |
TTATCTC | 865 | 0.0 | 42.10584 | 38-39 |
CTGCTTT | 95 | 0.0 | 42.04865 | 56-57 |