institute of biotechnology >> brc >> bioinformatics >> internal >> biohpc cloud: user guide
 

BioHPC Cloud:
: User Guide

 


BioHPC Cloud Software

There is 503 software titles installed in BioHPC Cloud. The sofware is available on all machines (unless stated otherwise in notes), complete list of programs is below, please click on a title to see details and instructions. Tabular list of software is available here

Please read details and instructions before running any program, it may contain important information on how to properly use the software in BioHPC Cloud.

, 454 gsAssembler or gsMapper, a5, ABRicate, ABruijn, ABySS, AdapterRemoval, Admixtools, Admixture, albacore, Alder, AlleleSeq, ALLMAPS, ALLPATHS-LG, AMOS, AMPHORA, analysis, ANGSD, Annovar, antiSMASH, apollo, Arlequin, Atlas-Link, ATLAS_GapFill, ATSAS, Augustus, AWS command line interface, axe, bamtools, Basset, BayeScan, Bayescenv, BBmap, BCFtools, bcl2fastq, BCP, Beagle, Beagle4, Beast2, bedops, BEDtools, bfc, bgc, bigQF, bigWig, biobambam, Bioconductor, biom-format, BioPerl, BioPython, Birdsuite, Bismark, blasr, BLAST, blast2go, BLAT, BMGE, bmtagger, Boost, Bowtie, Bowtie2, BPGA, BreedingSchemeLanguage, breseq, BSseeker2, BUSCO, BWA, bwa-meth, canu, CAP3, cBar, CBSU RNAseq, cd-hit, CEGMA, CellRanger, centrifuge, CFSAN SNP pipeline, CheckM, chromosomer, Circlator, Circos, Circuitscape, CLUMPP, Clustal Omega, CLUSTALW, Cluster, cmake, CNVnator, compat, CONCOCT, copyNumberDiff, cortex_var, CrossMap, CRT, cuda, Cufflinks, cutadapt, dadi, dadi-1.6.3_modif, dDocent, DeconSeq, deepTools, defusion, delly, destruct, DETONATE, diamond, Discovar, Discovar de novo, distruct, Docker, dREG, dREG.HD, Drop-seq, dropSeqPipe, dsk, ea-utils, ecopcr, EDGE, edirect, EIGENSOFT, EMBOSS, entropy, ephem, ermineJ, ete3, exabayes, exonerate, eXpress, FALCON, FALCON_unzip, Fast-GBS, fasta, fastcluster, FastML, fastp, fastq_species_detector, FastQC, fastStructure, FastTree, FASTX, fineRADstructure, fineSTRUCTURE, flash, flash2, Flexible Adapter Remover, FMAP, FragGeneScan, freebayes, FunGene Pipeline, GAEMR, Galaxy, GATK, gatk4, GBRS, gcc, GCTA, gdc-client, GEM library, GEMMA, geneid, GeneMark, GeneMarker, Genome STRiP, GenomeMapper, GenomeStudio (Illumina), GenomicConsensus, gensim, germline, giggle, GMAP/GSNAP, GNU Compilers, GNU parallel, gradle-4.4, graftM, Grinder, GROMACS, GSEA, Gubbins, HapCompass, HAPCUT, HAPCUT2, hapflk, HaploMerger, Haplomerger2, HapSeq2, HarvestTools, HiC-Pro, HiCExplorer, HISAT2, HMMER, Homer, HOTSPOT, HTSeq, HUMAnN2, hyperopt, HyPhy, iAssembler, IBDLD, IDBA-UD, IDP-denovo, IgBLAST, IGV, IMa2, IMa2p, IMAGE, impute2, INDELseek, infernal, InStruct, InteMAP, InterProScan, ipyrad, IQ-TREE, iRep, java, jbrowse, jellyfish, JoinMap, julia, jupyter, kallisto, Kent Utilities, khmer, kSNP, kWIP, LACHESIS, LAST, lcMLkin, LDAK, leeHom, Lep-MAP3, LINKS, LocusZoom, longranger, LUCY, LUCY2, LUMPY, lyve-SET, MACS, MaCS simulator, MACS2, MAFFT, mafTools, Magic-BLAST, MAKER, MAQ, MASH, MaSuRCA, Mauve, MaxBin, mccortex, megahit, MeGAMerge, MEGAN, MELT, MEME Suite, MERLIN, MetaBAT, metaCRISPR, MetAMOS, MetaPathways, MetaPhlAn, MetaVelvet, MetaVelvet-SL, Migrate-n, mira, miRDeep2, MISO (misopy), MixMapper, MKTest, MMAP, MMSEQ, mothur, MrBayes, mrsFAST, msld, MSMC, msprime, MSR-CA Genome Assembler, msstats, MSTMap, mugsy, MultiQC, MUMmer, muscle, MUSIC, muTect, ncftp, Nemo, Netbeans, NEURON, new_fugue, NextGenMap, NGS_data_processing, NGSadmix, ngsDist, ngsF, ngsTools, NGSUtils, NLR-Parser, Novoalign, NovoalignCS, Oases, OBITools, Orthomcl, PacBioTestData, PAGIT, PAML, pandas, pandaseq, PanPhlAn, Panseq, Parsnp, PASA, PASTEC, PAUP*, pbalign, pbh5tools, PBJelly, PBSuite, PeakRanger, PeakSplitter, PEAR, PennCNV, PfamScan, PGDSpider, ph5tools, Phage_Finder, PHAST, PHRAPL, PHYLIP, PhyloCSF, phylophlan, PhyML, Picard, Pilon, Pindel, piPipes, PIQ, PlasFlow, Platypus, plink, Plotly, popbam, prinseq, prodigal, progressiveCactus, prokka, pyani, pyfaidx, pyGenomeTracks, PyMC, pyRAD, Pyro4, PySnpTools, PyTorch, PyVCF, QIIME, QIIME2 q2cli, QTCAT, Quake, Qualimap, QuantiSNP2, QUAST, QUMA, R, RACA, racon, RADIS, RAPTR-SV, RAxML, Ray, Rcorrector, RDP Classifier, REAPR, RepeatMasker, RepeatModeler, RFMix, RGI, RNAMMER, rnaQUAST, Roary, Rqtl, Rqtl2, RSEM, RSeQC, RStudio, rtfbs_db, sabre, SaguaroGW, salmon, Sambamba, samblaster, SampleTracker, Samtools, Satsuma, Satsuma2, scikit-learn, Scoary, scythe, selscan, Sentieon, SeqPrep, sgrep, sgrep sorted_grep, SHAPEIT, shore, SHOREmap, shortBRED, SHRiMP, sickle, SignalP, simuPOP, sistr_cmd, skewer, SLiM, smcpp, SMRT Analysis, SMRT LINK, snakemake, snap, SNAPP, snATAC, SNeP, snippy, SNPhylo, SOAP2, SOAPdenovo, SOAPdenovo-Trans, SOAPdenovo2, SomaticSniper, sorted_grep, SPAdes, SparCC, SRA Toolkit, srst2, stacks, Stacks 2, stampy, STAR, statmodels, STITCH, STPGA, Strelka, stringMLST, StringTie, STRUCTURE, supernova, SURPI, sutta, SVDetect, svtools, SweepFinder, sweepsims, tabix, Tandem Repeats Finder (TRF), TASSEL 3, TASSEL 4, TASSEL 5, tcoffee, TensorFlow, TEToolkit, TMHMM, tmux, TopHat, traitRate, Trans-Proteomic Pipeline (TPP), TransComb, TransDecoder, transrate, TRAP, treeCl, treemix, trimmomatic, Trinity, Trinotate, tRNAscan-SE, UCSC Kent utilities, UMI-tools, Unicycler, unrar, usearch, Variant Effect Predictor, VarScan, vcf2diploid, vcfCooker, vcflib, vcftools, Velvet, VESPA, ViennaRNA, VIP, VirSorter, VirusDetect, VirusFinder 2, VizBin, vsearch, WASP, wgs-assembler (Celera), Wise2 (Genewise), Xander_assembler, yaha

Details for MAKER (hide)

Name:MAKER
Version:2.31.9
OS:Linux
About:MAKER is an easy-to-configure, portable genome annotation pipeline. MAKER allows smaller eukaryotic genome projects and prokaryotic genome projects to annotate their genomes and to create genome databases.
Added:1/23/2013 4:46:28 PM
Updated:12/19/2017 4:31:30 PM
Link:http://gmod.org/wiki/MAKER
Notes:

Maker is slow, and it could take up to a week to finish a maker run with MPI, and much longer without MPI. It is recommended then you run maker with MPI, and use medium memory generation 2 (for medium size job) or large memory generation 2 computer (for large genome)  on BioHPC. If a job is stopped during the run, it can be restarted from where it stops. 

 

Here is the instruction to run maker with MPI. You need to copy maker software to the local /workdir.

cd /workdir
cp -rH /programs/maker /workdir

export PATH=/workdir/maker/bin:$PATH

 

## create a working directory for your data, and put your genome fasta file in your working directory

mkdir /workdir/MyUserID

cd /workdir/MyUserID

 

## create configure file

maker -CTL   

 

## create a tmp directory

mkdir /workdir/tmp

 

## Modify the maker_opts.ctl file created by the previous command

#1. modify the last line of the maker_opts.ctl, so that it looks like: TMP=/workdir/tmp 

#2. modify the parameters in maker_opts.ctl and maker_bopts.ctl  that are appropriate for your project

 

##: Run mpi maker.

## Note: As maker take several days to finish, you need to run maker with "screen" or "nohup" so that the job will keep running in the background. 

## The following instruction uses "screen" (Documentation for "screen": https://www.rackaid.com/blog/linux-screen-tutorial-and-how-to/ ) :

screen

export PATH=/workdir/maker/bin:$PATH

which maker          #The step is to confirm that you are using maker on /workdir

 

## before running the next step, check how many CPU cores that your reserved computer has. You can find the information at "My Reservation" page of the  BioHPC web site under "System info". Replace "N" in the next command with the number of cores.

 /usr/local/mpich/bin/mpiexec -n N maker -qq  >& log &

 

## press "ctrl-a" followed by press "d" to detach from "screen", now you can safely close your laptop.

## to re-attach to "screen", use the command "screen -r".

 

Please note, the maker software created large number of intermediate files. With our current file system, it is not recommended to save large number of small files in home directory.   If you want to save the maker results from /workdir to your home directory, make sure you make a tarball, and copy the tar file to home directory. For example: 

 

tar -cvfz my.maker.output.tar.gz my.maker.output

When you want to work on the directory later, you can copy the .tar.gz  file to work directory, do 

tar -xvfz my.maker.output.tar.gz


Notify me if this software is upgraded or changed [You need to be logged in to use this feature]

 

Website credentials: login  Web Accessibility Help