|
- AmadeusPBM v1.0
NEW:
AmadeusPBM v1.0
with Allegro v1.0 and Amadeus v1.2 (12 Jul 2011)
Follow this link to download the files necessary to execute AmadeusPBM/Allegro/Amadeus.
You will also need to download regulatory sequences for the organism(s) you
are analyzing or PBM file in case you are using AmadeusPBM - see below.
This version includes Amadeus, Allegro and AmadeusPBM. AmadeusPBM allows reconstruction
of binding site motifs from protein binding microarray (PBM) data.
Amadeus PBM is built on top of Allegro and has the same operating systems and hardware
specifications. Additional hardware and software requirements, as well as detailed
instructions on how to execute the program, are found in README.txt file in the
download package in the overview page.
An example of a PBM dataset is available below (under "PBM data"). More PBM files
can be downloaded from
http://the_brain.bwh.harvard.edu/uniprobe/.
The software gets a PBM filename, scoring scheme to choose, k as the length of the
k-mer to work on, and how many k-mers to treat as a target set. The output is a
binding site in PWM logo and a list of similar PWMs from transfac.
Amadeus/Allegro is a Java application.
It requires Java 1.5 and at least 1GB RAM memory.
Allegro should run properly on Windows (2000/XP/Vista) and
Linux (Debian). It was not tested on Mac OS.
Additional hardware and software requirements, as well as detailed
instructions on how to execute the program, are found in the README.txt file
in the download package and in the
overview page.
The AmadeusPBM/Allegro/Amadeus software is freely available for academic use.
It is also available for non-academic use under appropriate licensing.
Please contact Ron Shamir (rshamir AT post.tau.ac.il) or
Chaim Linhart (chaiml AT post.tau.ac.il) for further information.
- Regulatory sequences
Please extract the zip files into the Allegro/Amadeus installation directory.
Each zip file contains a file with promoter/3'-UTR sequences,
and a corresponding background-set file.
After unzipping, the files should eventually reside
under [installation_dir]/Allegro_v1.0/data/sequences/[organism]/.
All sequences are repeat masked, unless indicated otherwise.
Promoters *
| Organism |
Source |
Version |
# genes |
File size |
| Human |
Ensembl |
Mar '09 (53) NEW |
19,336 |
12.2 MB |
| Chimpanzee |
Ensembl |
Dec '06 (42) |
25,481 |
25.4 MB |
| Dog |
Ensembl |
Jun '07 (45) |
19,573 |
20.8 MB |
| Mouse |
Ensembl |
Mar '09 (53) NEW |
19,627 |
13.5 MB |
| Rat |
Ensembl |
Oct '07 (47) |
23,385 |
26.2 MB |
| Chicken |
Ensembl |
Sep '09 (56) NEW |
16,730 |
12.1 MB |
Chicken.masked_promoters.Ensembl_v56.zip
| Zebrafish |
Ensembl |
Mar '09 (53) NEW |
21,048 |
13.1 MB |
| Fruitfly |
Ensembl |
Mar '09 (53) NEW |
14,136 |
8.9 MB |
| C. elegans |
Ensembl |
Apr '07 (44) |
19,656 |
18.2 MB |
| S. cerevisiae |
SGD |
Dec '06 (1.01) |
6,532 |
2.1 MB |
| S. pombe ** |
GeneDB |
Oct '08 NEW |
4,853 |
0.8 MB |
| Arabidopsis |
TAIR |
Feb '08 |
33,282 |
25.3 MB |
* Sequences extracted from the Ensembl project include 3 Kb upstream of the TSS,
and the first two exons and introns.
As of release 50, promoter sequences were downloaded
via BioMart, and include 3 Kb upstream
and the 5' utr.
** Promoters of S. pombe are not repeat-masked.
* As of release 52 of Ensembl, 3' UTR sequences were downloaded
via BioMart.
For genes with more than one sequence, the longest 3' UTR was selected.
|