ABS: a database of Annotated regulatory Binding Sites from orthologous promoters v 1.0

Genome BioInformatics Research Lab

Resources & Datasets | Gene Predictions | Seminars & Courses

IMIM

UPF

CRG

GRIB

Resources & Datasets

ABS

2.4. CONSTRUCTION OF BENCHMARKS

Description:
This is a web tool to create artificial data sets for testing pattern discovery programs. Artificial sequences are generated according to a certain nucleotide distribution with a set of embedded motifs from the ABS database. The motifs are planted in a subset of similar positions to simulate the existance of related motifs in a set of orthologous or co-regulated genes from a microarray experiment. A graphical representation of the produced data set is also displayed.

Parameters:
- Length: size of each artificial sequence (bp)
- Number of sequences: size of the data set
- Number of planted motifs: motifs included on each sequence
- Species: a specific data set to detect motifs from this species will be produced
- Nucleotide distribution: A,C,G,T content of the random regions between planted motifs
- Probability to plant a motif: probability to insert a motif in a sequence
- TF name: planted motifs are associated to these TFs

Example:
This is an artificial data set produced by this application.

Number of Sequences Number of Planted Motifs

Length (nucleotides) Probability to plant a motif

Species Background composition

A C G T

TF name (multiple choice)

HINTS:
1. More than one TF can be selected (multiple choice is allowed).
2. Use Shift to select groups of consecutive TFs
3. Use Ctrl to select groups of non consecutive TFs

Disclaimer

webmaster

AML
AP1
AP2
AP2A
CAAT
CEBP
CETS
CJUN
CMYC
COUP
CREB
CUTL
E2
E2F
E2F1
E47
EGR1
ETS1
FXR
GATA3
GATA6
GC
GR
HIF1
HLTF
HMGI
HNF1
HNF3
HNF3B
HNF4
HNF6
HOX
HP1
HSF1
IPF1
IRF
M1
MAZ
MEF2
MEF3
MITF
MTF1
MYB
MYF
MYOD
MZF1
NF1
NFAT
NFKB
NFY
NKX2
NRL
OCT
OLF1
P53
POU1F1
POU2F1
RAR
RFX1
RXR
SMAD
SP1
SRF
STAT3
TBP
TEF1
USF
VDR