Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005186827 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 16394750 |
Sequences flagged as poor quality | 0 |
Sequence length | 150 |
%GC | 52 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACACAGCAGAATCTCGTAT | 45202 | 0.2757102121105842 | TruSeq Adapter, Index 5 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGTGAGT | 3970 | 0.0 | 38.419086 | 1 |
GTGAGTA | 4525 | 0.0 | 33.573772 | 2 |
TAATGGG | 5550 | 0.0 | 27.49967 | 9 |
GAGCACA | 27960 | 0.0 | 25.851223 | 9 |
GATCGGA | 28780 | 0.0 | 25.317736 | 1 |
AGAGCAC | 29965 | 0.0 | 24.265636 | 8 |
TCGGAAG | 30260 | 0.0 | 24.149872 | 3 |
ATCGGAA | 33255 | 0.0 | 22.235643 | 2 |
AAGAGCA | 39750 | 0.0 | 18.70888 | 7 |
GGAAGAG | 42050 | 0.0 | 18.216301 | 5 |
CGGAAGA | 40720 | 0.0 | 18.139065 | 4 |
GAAGAGC | 43005 | 0.0 | 17.460228 | 6 |
TGAGTAT | 10625 | 0.0 | 14.839966 | 3 |
GAGTATA | 10975 | 0.0 | 14.365307 | 4 |
AGTCACA | 11865 | 0.0 | 12.20777 | 25-29 |
CTCGTAT | 12605 | 0.0 | 11.560048 | 40-44 |
CACATTA | 6870 | 0.0 | 11.212737 | 6 |
TCACACA | 12845 | 0.0 | 11.10846 | 30-34 |
GTCACAC | 13240 | 0.0 | 11.005218 | 25-29 |
CAGTCAC | 13690 | 0.0 | 10.811745 | 25-29 |