Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005431824 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 142170280 |
Sequences flagged as poor quality | 0 |
Sequence length | 75 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAA | 671113 | 0.47204872917180724 | TruSeq Adapter, Index 2 (100% over 63bp) |
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCGATGTATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAAAA | 319560 | 0.2247727162104485 | TruSeq Adapter, Index 2 (100% over 63bp) |
CTCAGATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAACGGTAACAGGAAGAAGCTTGCTTCTTTGCTG | 178158 | 0.12531311044755628 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
AGATCGG | 107165 | 0.0 | 51.866367 | 1 |
TCGTATG | 135745 | 0.0 | 40.531803 | 44 |
CTCGTAT | 135325 | 0.0 | 40.364475 | 43 |
CGTATGC | 138190 | 0.0 | 39.842117 | 45 |
TATGCCG | 137985 | 0.0 | 39.764683 | 47 |
TATCTCG | 133755 | 0.0 | 39.27539 | 40 |
ACACGTC | 146400 | 0.0 | 37.91457 | 14 |
GATCGGA | 147115 | 0.0 | 37.894714 | 2 |
ACCGATG | 145540 | 0.0 | 37.764256 | 33 |
ATCTCGT | 140410 | 0.0 | 37.48272 | 41 |
CGATGTA | 148875 | 0.0 | 36.844532 | 35 |
ACGTCTG | 151680 | 0.0 | 36.584843 | 16 |
CACCGAT | 151440 | 0.0 | 36.286263 | 32 |
TTGAACG | 59520 | 0.0 | 36.120728 | 7 |
TCACCGA | 155080 | 0.0 | 35.60278 | 31 |
CCGATGT | 155210 | 0.0 | 35.39043 | 34 |
ATGCCGT | 154355 | 0.0 | 35.382206 | 48 |
CACGTCT | 158320 | 0.0 | 35.08972 | 15 |
GTATGCC | 158090 | 0.0 | 34.863758 | 46 |
GTCACCG | 159135 | 0.0 | 34.700344 | 30 |