BioHPC Lab Software

There is 391 software titles installed in BioHPC Lab.

Please read details and instructions before running any program, it may contain important information on how to properly use the software in BioHPC Lab.

InterProScan

About:InterProScan is a bioinformatics tool that provides a one-stop-shop for automated sequence analysis of both protein and nucleic acid, the latter via a full six-frame translation. It offers the ability to identify both structural and functional regions of interest, based upon methods and models that have been generated by a large number of member groups ('member databases').
Added:9/20/2013 4:28:14 PM
Updated:3/23/2016 10:48:02 AM

Interproscan needs to be unpacked before using. Go to your directory under /workdir
cd /workdir/mydir
and then execute
tar -xf /shared_data/genome_db/interproscan.tar
Your copy of interproscan will be in subdirectory interproscan in the directory you executed the command from.
To run the program please use its full path

Manual of interproscan:

To get help:

Sample command ("-t n" indicate it is DNA sequence in fasta file, output in XML format. InterProScan requires Java 1.8):

/workdir/mydir/interproscan/ -b out -f XML -i test.fasta --goterms --pathways --iprlookup -t n

***INTERPROSCAN is very slow. It could take ~5 minutes per gene. To speed it up:

a. You can split your query fasta in multiple files, reserve multiple BioHPC computers to precess each file.

b. After you you untar the interproscan.tar file, you need to modify the file, and change 

number.of.embedded.workers and maxnumber.of.embedded.worker. The maxnumber should be [total core on the computer] -1. The number should be sligtly less than  maxnumber.

There is a local InterProScan Lookup Service set up on on port 8082. This is the default lookup service for our interproscan.

