Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005445222 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20547161 |
Sequences flagged as poor quality | 0 |
Sequence length | 85 |
%GC | 52 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 582990 | 2.837326285611915 | No Hit |
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 511190 | 2.4878862826840162 | No Hit |
TGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 138770 | 0.6753731087229033 | No Hit |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGATCAGATCTCGTATGC | 92181 | 0.4486313218648552 | TruSeq Adapter, Index 9 (100% over 50bp) |
TTTTTGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG | 68549 | 0.33361786574797364 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGGGGGG | 81925 | 0.0 | 56.803455 | 1 |
TCGTATG | 11340 | 0.0 | 36.29502 | 42-43 |
TATGCCG | 11660 | 0.0 | 35.281605 | 46-47 |
CGTATGC | 12700 | 0.0 | 32.454845 | 44-45 |
CTCGTAT | 12340 | 0.0 | 32.32947 | 42-43 |
GTATGCC | 14035 | 0.0 | 29.480337 | 44-45 |
TTTTGGG | 21890 | 0.0 | 29.159498 | 2 |
GTCACGA | 14315 | 0.0 | 29.07103 | 28-29 |
ACGATCA | 14555 | 0.0 | 28.77004 | 32-33 |
AGTCACG | 15115 | 0.0 | 28.028921 | 28-29 |
GCCGTCT | 14900 | 0.0 | 27.768421 | 48-49 |
CACGATC | 15125 | 0.0 | 27.568281 | 30-31 |
TTTGGGG | 24870 | 0.0 | 26.237083 | 3 |
ATGCCGT | 16080 | 0.0 | 26.050272 | 46-47 |
TCACGAT | 16265 | 0.0 | 25.976076 | 30-31 |
CGATCAG | 17020 | 0.0 | 24.510445 | 32-33 |
GATCTCG | 16495 | 0.0 | 24.29441 | 38-39 |
TGCCGTC | 17485 | 0.0 | 24.148794 | 48-49 |
GAGCACA | 35030 | 0.0 | 24.095982 | 9 |
CCGTCTT | 17380 | 0.0 | 23.828793 | 50-51 |