Source Information for ens_35_segdup_sanger

Description

Segmental duplication dataset compiled by seeking homologies within the reference sequence using SSAHA2 and SEGMENT (unpublished) software.

The data were kindly provided by Nikolai Ivanov and Tony Cox at The Wellcome Trust Sanger Institute.

A region was considered a low copy repeat (LCR) if it had at least 90% identity to a different region of at least 3kb of the same chromosome.

Reference:
Ning Z, Cox AJ, Mullikin JC. (2001) SSAHA: a fast search method for large DNA databases. Genome Res. 2001 Oct;11(10):1725-9.

Key

Orientation
   Red: Pair members in direct orientation
   Green: Pair members in inverted orientation
   Blue: Pair members on different chromosomes
     
% Identity
   Over 99% identity
   97 - 99% identity
   95 - 97% identity
   Less than 95% identity
     
Spacing
   Less than 100Kb apart
   100Kb - 1Mb apart
   1 - 5Mb apart
   5 - 10Mb apart
   More than 10Mb apart