Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005432048 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 69028972 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 50 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
ATCGGAAGAGCACACGTCTGAACTCCAGTCACGGTACCTTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAA | 1324684 | 1.9190261155852069 | TruSeq Adapter, Index 22 (97% over 37bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACGGTACCTTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA | 321583 | 0.46586670883639986 | TruSeq Adapter, Index 22 (97% over 38bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TATGCCG | 177945 | 0.0 | 53.83951 | 47 |
TCGTATG | 180490 | 0.0 | 53.17836 | 44 |
TACCTTA | 181235 | 0.0 | 52.879387 | 35 |
TATCTCG | 179875 | 0.0 | 52.558994 | 40 |
CGTATGC | 182985 | 0.0 | 52.47058 | 45 |
ACGGTAC | 184070 | 0.0 | 52.15079 | 31 |
CTCGTAT | 183880 | 0.0 | 52.14532 | 43 |
CGGTACC | 184770 | 0.0 | 51.95135 | 32 |
AGTCACG | 186665 | 0.0 | 51.34799 | 27 |
ATGCCGT | 187820 | 0.0 | 50.973934 | 48 |
GTATGCC | 188650 | 0.0 | 50.84801 | 46 |
TCTCGTA | 186060 | 0.0 | 50.83342 | 42 |
ACCTTAT | 186335 | 0.0 | 50.78158 | 36 |
GCCGTCT | 190215 | 0.0 | 50.288593 | 50 |
CACGGTA | 191105 | 0.0 | 50.28155 | 30 |
ACACGTC | 191510 | 0.0 | 50.179184 | 12 |
ACGTCTG | 191630 | 0.0 | 50.132633 | 14 |
CGTCTGA | 192030 | 0.0 | 49.995686 | 15 |
TGCCGTC | 191690 | 0.0 | 49.97542 | 49 |
GTCACGG | 192655 | 0.0 | 49.828484 | 28 |