Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005202954 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 1809940 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
CGACTCTTAGCGGTGGATCACTCGGCTCGTGCGTCGATGAAGAACGCAGC | 4544 | 0.2510580461230759 | No Hit |
AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCT | 3097 | 0.1711106445517531 | Illumina Single End PCR Primer 1 (100% over 50bp) |
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 2929 | 0.16182856890283656 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACTAAGAAATCTCGTATGC | 2469 | 0.13641336176889843 | TruSeq Adapter, Index 3 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TGATACG | 1985 | 0.0 | 41.669876 | 3 |
TACGGCG | 2045 | 0.0 | 41.52301 | 6 |
ATACGGC | 2075 | 0.0 | 40.81666 | 5 |
GATACGG | 2080 | 0.0 | 40.718548 | 4 |
ACGGCGA | 2110 | 0.0 | 40.556644 | 7 |
CGACGCT | 790 | 0.0 | 38.427917 | 42 |
CGGCGAC | 2240 | 0.0 | 38.399326 | 8 |
ACACGAC | 840 | 0.0 | 38.23776 | 39 |
CCACCGA | 2230 | 0.0 | 38.176926 | 14 |
CACGACG | 820 | 0.0 | 38.097225 | 40 |
ACCACCG | 2220 | 0.0 | 38.051617 | 13 |
ATGATAC | 2175 | 0.0 | 38.02975 | 2 |
GCGACCA | 2240 | 0.0 | 38.006493 | 10 |
GATCTAC | 2195 | 0.0 | 37.584057 | 21 |
ACCGAGA | 2250 | 0.0 | 37.545296 | 16 |
TACACGC | 770 | 0.0 | 37.427208 | 25 |
AGATCTA | 2240 | 0.0 | 37.22186 | 20 |
ACGACGC | 840 | 0.0 | 37.19015 | 41 |
TCGTATG | 345 | 0.0 | 36.98314 | 43 |
TACACGA | 865 | 0.0 | 36.879307 | 38 |