Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005189345 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 11007862 |
Sequences flagged as poor quality | 0 |
Sequence length | 35-51 |
%GC | 42 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN | 2104002 | 19.113629876537335 | No Hit |
GGAAGAGCACACGTCTGAACTCCAGTCACCGCATACAATCTCGTATGCCGT | 36003 | 0.3270662368405418 | TruSeq Adapter, Index 22 (96% over 33bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ATCGGAA | 2770 | 0.0 | 40.183475 | 1 |
ATGCCGT | 7535 | 0.0 | 38.521935 | 45 |
TCGGAAG | 2875 | 0.0 | 36.678684 | 2 |
CGGAAGA | 2830 | 0.0 | 33.327778 | 3 |
TATGCCG | 8310 | 0.0 | 24.709764 | 44 |
CACACGT | 7930 | 0.0 | 22.861465 | 8 |
ACACGTC | 8115 | 0.0 | 22.806278 | 9 |
GCACACG | 8045 | 0.0 | 22.562319 | 7 |
CGTCTGA | 8620 | 0.0 | 21.650696 | 12 |
ACGTCTG | 8715 | 0.0 | 21.338118 | 11 |
CACGTCT | 8725 | 0.0 | 21.237293 | 10 |
CGTCGGA | 105 | 6.904884E-8 | 21.18521 | 15 |
GAAGAGC | 8650 | 0.0 | 21.141167 | 2 |
AGAGCAC | 8785 | 0.0 | 20.738104 | 4 |
GTATGCC | 9335 | 0.0 | 20.508715 | 43 |
GAGCACA | 8890 | 0.0 | 20.468143 | 5 |
CGTATGC | 9360 | 0.0 | 19.92495 | 42 |
GTCACCG | 9515 | 0.0 | 19.479275 | 25 |
GCACGTC | 585 | 0.0 | 19.392506 | 10 |
CCGCATA | 9860 | 0.0 | 18.799492 | 29 |