Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00003249844 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 317275812 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACAGCGATAGATCTCGTATG | 577105 | 0.18189379025212296 | TruSeq Adapter, Index 1 (97% over 35bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CGGAAGA | 724255 | 0.0 | 12.659012 | 3 |
TCGGAAG | 724855 | 0.0 | 12.636113 | 2 |
ATCGGAA | 729365 | 0.0 | 12.520481 | 1 |
AGAGCAC | 813430 | 0.0 | 11.451467 | 7 |
AGCACAC | 822585 | 0.0 | 11.332839 | 9 |
GAGCACA | 843590 | 0.0 | 11.122789 | 8 |
GAAGAGC | 893685 | 0.0 | 10.499309 | 5 |
TATGCCG | 335095 | 0.0 | 10.180433 | 3 |
ATGCCGT | 340460 | 0.0 | 10.068542 | 4 |
GCCGTCT | 348920 | 0.0 | 9.886617 | 6 |
TGCCGTC | 350155 | 0.0 | 9.818639 | 5 |
CGTCTTC | 369400 | 0.0 | 9.448333 | 8 |
AAGAGCA | 1047000 | 0.0 | 9.114807 | 6 |
CCGTCTT | 400985 | 0.0 | 8.718556 | 7 |
GGAAGAG | 1121550 | 0.0 | 8.562647 | 4 |
GTATGCC | 432500 | 0.0 | 8.000274 | 2 |
CGTATGC | 358630 | 0.0 | 7.245053 | 1 |
CTCGTAT | 314105 | 0.0 | 5.944401 | 40-44 |
TCTCGTA | 319570 | 0.0 | 5.8817697 | 40-44 |
GTCTTCT | 691410 | 0.0 | 5.5185013 | 9 |