Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005432027 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 67850527 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGTACTCCGTGTAGATCTCGGTGGTCGCCGTATCATTAAAAAAA | 569680 | 0.83961028040357 | Illumina Single End PCR Primer 1 (96% over 33bp) |
GATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGTACTCCGTGTAGATCTCGGTGGTCGCCGTATCATTAAAAAA | 217127 | 0.32000783133195115 | Illumina Single End PCR Primer 1 (97% over 34bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGCCGTA | 84020 | 0.0 | 47.664207 | 57 |
GCCGTAT | 85085 | 0.0 | 47.07165 | 58 |
CGTATCA | 85735 | 0.0 | 46.734383 | 60 |
CCGTGTA | 88290 | 0.0 | 45.50695 | 38 |
GTAGATC | 88695 | 0.0 | 45.275913 | 42 |
CCGTATC | 88830 | 0.0 | 45.226982 | 59 |
GTAGTAC | 89845 | 0.0 | 44.67355 | 30 |
GCGTCGT | 90805 | 0.0 | 44.341347 | 10 |
TCGCCGT | 91110 | 0.0 | 44.000515 | 56 |
TGGTCGC | 91375 | 0.0 | 43.905937 | 53 |
TCGTGTA | 91700 | 0.0 | 43.8594 | 13 |
TCTCGGT | 91960 | 0.0 | 43.77171 | 47 |
GGTCGCC | 92005 | 0.0 | 43.676544 | 54 |
TAGATCT | 92455 | 0.0 | 43.473083 | 43 |
GTGTAGT | 92415 | 0.0 | 43.442413 | 28 |
GTCGTGT | 93165 | 0.0 | 43.20305 | 12 |
TAGTACT | 93740 | 0.0 | 43.03665 | 31 |
GTATCAT | 93525 | 0.0 | 42.959778 | 61 |
ACTCCGT | 93750 | 0.0 | 42.952305 | 35 |
GAGTGTA | 93400 | 0.0 | 42.917778 | 26 |