PLEASE SEE THE MANUAL PDF VCFtoTreeManual_V3.0.0_Oct.pdf
IN THIS FOLDER FOR DETAILED INSTRUCTIONS!!! -- Credited to Izzy Starr
Brief installations and instructions are under V1.0.0 (please scroll down)
Please make sure you have installed all the necessary tools as described before using the VCFtoTree.
Updates:
-
It can now be applied to other species besides human.
-
In human section, you can also add in your customized vcf files, and you can specify the number of individuals in your vcf file.
-
Users can choose whether to build the tree or not, also to build the tree using either FastTree or RAxML. (To note that, VCFtoTree v3.0.0 compiles the FastTree with
-DUSE_DOUBLE
by default, lifting the hard-coded limit on branch length precision. Please see detail in this blog.) -
!!!!!!!!!Vindija Neanderthal can NOT be used!!!!!!!!! in this version right now due to the fact that the data haven't been published. It will be updated once it is published.
-
To use the functioal of "other species" and "customized human", if your customized vcf file or reference are from a local address, please ente 75EB r the full address of the file.
-
For customized vcf file and reference file, please use the compressed version of them as in ".fa.gz" or ".vcf.gz", the program will index them for you.
-
To enter the URL of your customized file, please use the URL begins with "http://".
https://www.dropbox.com/sh/sewjxmh9207fp89/AAASfUGqMBtd2_HO3CR7XA3ha?dl=0
Updates:
- Now users can choose the populations that they are interested, or "ALL" to include all the populations.
- Added the new Neanderthal Vindija to the GUI.
https://www.dropbox.com/sh/sewjxmh9207fp89/AAASfUGqMBtd2_HO3CR7XA3ha?dl=0
Build phylogeny from 1000 Genomes Phase 3 VCF file for 2504 individuals, also including ancient (Neanderthal, Denisovan) and primate genomes.
VCF --> fasta sequence (whole sequence, not just variable sites) --> Phylogeny
We provide a user friendly interface, in which you can choose the genomes you want to add into the phylogeny.
Raw scripts can be found in Code folder.
VCFtoTree was designed for Mac OS users, has been tested on El Captian V10.11.5 with python 2.7.11
Here are the programs/packages needed to run the VCFtoTree.
- For first time users, you need to make sure your Mac has install the command line tools from Apple.com
- samtools executable file needs to be installed in your /usr/local/bin
- tabix needs to be installed in your /usr/local/bin
- wget needs to be installed in your /usr/local/bin
- gcc needs to be installed in your /usr/bin
The easiest way to check if you have those tools, type the commands below into your Terminal:
find /usr/local/bin/tabix
find /usr/local/bin/samtools
find /usr/local/bin/wget
find /usr/bin/gcc
After making sure that you have the above three tools installed, you can download and use the app now.
The running time of the app mostly depends on 1) downloading the Neanderthal vcf file; 2) building the phylogeny using RAxML.
After the whole process finishing, the app will generate a VCFtoTree_Output folder right next to your application in which you can find the newick tree file and all other output files.
The folder will be empty before the app finishes. After the app finishing, the folder will be open automatically, you will be able to see all the files generated.
After each run, please change the previous output folder name to something else to avoid overwrite.
On the screen there will also be message saying that "Your tree is completed."
Your tree file is the file with ".newick" extention.
https://www.dropbox.com/sh/sewjxmh9207fp89/AAASfUGqMBtd2_HO3CR7XA3ha?dl=0
The first working version of VCFtoTree.app.