Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005202955 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1819561 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCT | 4629 | 0.2544020233451915 | Illumina Single End PCR Primer 1 (100% over 50bp) |
CGACTCTTAGCGGTGGATCACTCGGCTCGTGCGTCGATGAAGAACGCAGC | 4579 | 0.25165410777654607 | No Hit |
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 3065 | 0.16844722435796325 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTAAGAAATCTCGTATGC | 2646 | 0.14541969189271478 | TruSeq Adapter, Index 3 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TGATACG | 2135 | 0.0 | 41.72867 | 3 |
GATACGG | 2185 | 0.0 | 41.679863 | 4 |
ATACGGC | 2195 | 0.0 | 41.48998 | 5 |
TACGGCG | 2225 | 0.0 | 40.93056 | 6 |
GCGACCA | 2320 | 0.0 | 40.01307 | 10 |
ACGGCGA | 2320 | 0.0 | 39.91825 | 7 |
ACGACGC | 970 | 0.0 | 39.467396 | 41 |
ATGATAC | 2270 | 0.0 | 39.24701 | 2 |
GATCTAC | 2335 | 0.0 | 39.190773 | 21 |
CGGCGAC | 2380 | 0.0 | 39.004333 | 8 |
ACCACCG | 2405 | 0.0 | 38.598885 | 13 |
TACACGA | 1010 | 0.0 | 38.55785 | 38 |
CGACCAC | 2440 | 0.0 | 38.13537 | 11 |
CCACCGA | 2455 | 0.0 | 37.902363 | 14 |
CGACGCT | 985 | 0.0 | 37.522026 | 42 |
ACCGAGA | 2490 | 0.0 | 37.369595 | 16 |
GGCGACC | 2500 | 0.0 | 37.308105 | 9 |
CACGACG | 1035 | 0.0 | 37.201344 | 40 |
AGATCTA | 2490 | 0.0 | 36.751186 | 20 |
TCTACAC | 2495 | 0.0 | 36.589367 | 23 |