Basic Statistics
Measure | Value |
---|---|
Filename | EGAF00005202919 |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 7599799 |
Sequences flagged as poor quality | 0 |
Sequence length | 50 |
%GC | 49 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
GATCGGAAGAGCACACGTCTGAACTCCAGTCACCTATACATCTCGTATGC | 30211 | 0.3975236713497291 | TruSeq Adapter, Index 2 (97% over 37bp) |
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA | 12704 | 0.1671623157401926 | No Hit |
CGACTCTTAGCGGTGGATCACTCGGCTCGTGCGTCGATGAAGAACGCAGC | 12300 | 0.16184638567414744 | No Hit |
AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCT | 12225 | 0.16085951746881727 | Illumina Single End PCR Primer 1 (100% over 50bp) |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
TGATACG | 6645 | 0.0 | 41.579163 | 3 |
TACGGCG | 6780 | 0.0 | 41.367714 | 6 |
ATACGGC | 6825 | 0.0 | 41.22389 | 5 |
CGTATGC | 3860 | 0.0 | 40.520496 | 44 |
TCGTATG | 3885 | 0.0 | 40.202065 | 43 |
GATACGG | 6995 | 0.0 | 40.127674 | 4 |
ACGGCGA | 7180 | 0.0 | 39.58395 | 7 |
CTCGTAT | 4000 | 0.0 | 39.157276 | 42 |
GCGACCA | 7425 | 0.0 | 38.248184 | 10 |
GATCGGA | 4295 | 0.0 | 37.706093 | 1 |
GATCTAC | 7420 | 0.0 | 37.652367 | 21 |
TCTCGTA | 4210 | 0.0 | 37.418 | 41 |
CCACCGA | 7660 | 0.0 | 37.04606 | 14 |
CGGCGAC | 7690 | 0.0 | 36.90154 | 8 |
CGACCAC | 7690 | 0.0 | 36.844326 | 11 |
GGCGACC | 7725 | 0.0 | 36.734344 | 9 |
ATCGGAA | 4400 | 0.0 | 36.646484 | 2 |
ACCACCG | 7750 | 0.0 | 36.53069 | 13 |
ACGACGC | 3085 | 0.0 | 36.51442 | 41 |
ACCGAGA | 7760 | 0.0 | 36.511963 | 16 |