Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004961671 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 41338606 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTAT | 58384 | 0.14123359650782613 | TruSeq Adapter, Index 22 (97% over 40bp) |
CTCAGATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAACGGTAA | 46663 | 0.11287995536182328 | No Hit |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGTACGTAATCTCGTA | 44764 | 0.10828618652501248 | TruSeq Adapter, Index 22 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 8265 | 0.0 | 34.291515 | 44 |
TTGAACG | 8340 | 0.0 | 26.27075 | 7 |
TCGAACG | 8715 | 0.0 | 24.55952 | 40 |
AGATCGG | 9035 | 0.0 | 23.857437 | 1 |
AACGCTG | 9440 | 0.0 | 23.002125 | 10 |
CGTACGT | 12375 | 0.0 | 22.807209 | 34 |
GAACGCT | 9685 | 0.0 | 22.602404 | 9 |
TACGTAA | 12620 | 0.0 | 22.5037 | 36 |
ACGGTAA | 9620 | 0.0 | 22.233347 | 44 |
GTCGAAC | 9950 | 0.0 | 21.555395 | 39 |
ACCGTAC | 13120 | 0.0 | 21.547104 | 32 |
ACGTAAT | 13015 | 0.0 | 21.431973 | 37 |
CGTAATC | 12935 | 0.0 | 21.360443 | 38 |
GTACGTA | 13360 | 0.0 | 21.290169 | 35 |
AACGGTA | 10090 | 0.0 | 21.169054 | 43 |
GAACGGT | 10375 | 0.0 | 20.693556 | 42 |
AGTCGAA | 10510 | 0.0 | 20.595837 | 38 |
CGAACGG | 10325 | 0.0 | 20.580715 | 41 |
GCCTAAC | 10535 | 0.0 | 20.505943 | 24 |
TGAACGC | 10850 | 0.0 | 20.37584 | 8 |