Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005329752 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 831907 |
Sequences flagged as poor quality | 0 |
Sequence length | 49-150 |
%GC | 38 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCCAGCATAATCTCGTAT | 885 | 0.10638208357424568 | TruSeq Adapter, Index 5 (97% over 37bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
GGGGGGG | 21790 | 0.0 | 12.495727 | 140-144 |
GACACCC | 380 | 6.215213E-5 | 11.006624 | 140-144 |
AGGGGGG | 905 | 4.3655746E-11 | 9.705289 | 140-144 |
AAGGGGG | 955 | 1.2241799E-9 | 8.759198 | 140-144 |
CGGAAGA | 1590 | 0.0 | 8.483502 | 4 |
TCGTATG | 1070 | 1.1878001E-9 | 8.208679 | 140-144 |
TCGGAAG | 1610 | 3.6379788E-12 | 8.067817 | 3 |
ATCGGAA | 1630 | 5.456968E-12 | 7.969304 | 2 |
CGTATGC | 1110 | 2.4301698E-9 | 7.9128704 | 140-144 |
GATCGGA | 1655 | 7.2759576E-12 | 7.8522267 | 1 |
GAGCACA | 1655 | 7.2759576E-12 | 7.847979 | 9 |
CCTTGGC | 595 | 0.008769574 | 7.5603695 | 1 |
AGAGCAC | 1725 | 1.8189894E-11 | 7.5295095 | 8 |
CTCGTAT | 1065 | 7.8214725E-8 | 7.4617686 | 140-144 |
TCTCGTA | 1050 | 4.8904803E-7 | 7.1700296 | 140-144 |
ATGCCGT | 1015 | 8.71114E-9 | 6.994162 | 130-134 |
GAAGAGC | 1855 | 7.348717E-10 | 6.733343 | 6 |
GCCGTCT | 975 | 1.9874278E-8 | 6.710995 | 125-129 |
CGTCTTC | 1045 | 1.727858E-7 | 6.705325 | 135-139 |
CCGTCTT | 1035 | 4.315698E-7 | 6.431606 | 135-139 |