Brzoska et al. sequences vs. Gencode


The dataset consists in 848 sequences coming from RTPCR/RACE experiments (article, fasta file) that were mapped onto hg17 using blat (min ID 95%, max. intron size 5Mb).
Only the best hit was kept for each sequence, and unspliced matches (i.e. longest intron of the hit < 33nts) present in this set were then filtered out.

9 of these hits overlap encode regions (psl file). Those 9 blat hits are referred to as 'User supplied track' in the UCSC browser screenshots below:






























Contact: Julien Lagarde (jlagardeatimim.es)