Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005726216 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 29539137 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCCGATCTCGTAT | 147353 | 0.4988398950179215 | TruSeq Adapter, Index 16 (97% over 40bp) |
AGATCGGAAGAGCACACGTCTGAACTCCAGTCACCCGTCCCGATCTCGTA | 140198 | 0.4746177926592777 | TruSeq Adapter, Index 16 (97% over 40bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTCGTAT | 17630 | 0.0 | 40.334637 | 44 |
AGATCGG | 21315 | 0.0 | 32.78399 | 1 |
CGTATGC | 3035 | 0.0 | 26.164097 | 1 |
TATGCCG | 3235 | 0.0 | 24.208544 | 3 |
ATACGAG | 2415 | 0.0 | 23.044481 | 16 |
GCATACG | 2545 | 0.0 | 21.954086 | 14 |
TCTCGTA | 33525 | 0.0 | 20.510923 | 43 |
CCGATCT | 33580 | 0.0 | 20.478022 | 39 |
CCCGATC | 33690 | 0.0 | 20.417688 | 38 |
CATACGA | 2760 | 0.0 | 20.403296 | 15 |
GTCCCGA | 35515 | 0.0 | 20.287565 | 36 |
TCCCGAT | 34130 | 0.0 | 20.285742 | 37 |
CGTCCCG | 35530 | 0.0 | 20.285192 | 35 |
GATCGGA | 36360 | 0.0 | 20.256039 | 1 |
ACCCGTC | 36380 | 0.0 | 20.17972 | 32 |
CGGCATA | 2825 | 0.0 | 20.012009 | 12 |
ATCTCGT | 34530 | 0.0 | 19.824661 | 42 |
ACACGTC | 36940 | 0.0 | 19.812166 | 13 |
CACCCGT | 37320 | 0.0 | 19.65399 | 31 |
ACGTCTG | 37530 | 0.0 | 19.459377 | 15 |