Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00004973608 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 458886177 |
Sequences flagged as poor quality | 0 |
Sequence length | 151 |
%GC | 41 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACCTGAAGCTATCTCGTATG | 656637 | 0.14309365435516266 | TruSeq Adapter, Index 19 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TCGGAAG | 602700 | 0.0 | 19.91037 | 2 |
ATCGGAA | 604295 | 0.0 | 19.820194 | 1 |
CGGAAGA | 610890 | 0.0 | 19.66123 | 3 |
AGAGCAC | 819905 | 0.0 | 14.901783 | 7 |
AGCACAC | 833480 | 0.0 | 14.688643 | 9 |
TATGCCG | 235510 | 0.0 | 14.621857 | 45-49 |
GAGCACA | 855615 | 0.0 | 14.348458 | 8 |
GAAGAGC | 901975 | 0.0 | 13.670539 | 5 |
CGTATGC | 254375 | 0.0 | 13.449124 | 45-49 |
ATGCCGT | 264570 | 0.0 | 13.039924 | 45-49 |
GCCGTCT | 266530 | 0.0 | 12.785214 | 50-54 |
CTCGTAT | 244870 | 0.0 | 12.762384 | 40-44 |
TCTCGTA | 254080 | 0.0 | 12.326587 | 40-44 |
CGTCTTC | 288805 | 0.0 | 12.014465 | 50-54 |
CCGTCTT | 297770 | 0.0 | 11.661508 | 50-54 |
TATCTCG | 269855 | 0.0 | 11.499091 | 40-44 |
ATCTCGT | 282080 | 0.0 | 11.145164 | 40-44 |
AAGAGCA | 1144335 | 0.0 | 10.971591 | 6 |
GATCGGA | 510115 | 0.0 | 10.311455 | 1 |
GGAAGAG | 1262030 | 0.0 | 10.02995 | 4 |