Skip to content
代码片段 群组 项目
README.md 4.5 KB
Newer Older
Aron Eklund's avatar
Aron Eklund 已提交
# MuPeXI: Mutant Peptide eXtractor and Informer #
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
Given a list of somatic mutations (VCF file) as input, MuPeXI returns a table containing
all mutated peptides (neo-peptides) of user-defined lengths, along with several pieces
of information relevant for identifying which of these neo-peptides are likely to serve as
Aron Eklund's avatar
Aron Eklund 已提交
neo-epitopes. 
ambj's avatar
ambj 已提交

NEW: MuPeXI is now tested and compatible for suquencing data of murine origin.

Aron Eklund's avatar
Aron Eklund 已提交
#### Authors: 
Anne-Mette Bjerregaard and Aron C. Eklund 
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
#### License: 
MuPeXI is freely available for academic users (see [License](/LICENSE)) ,  other users are requested to contact CBS Software Package Manager at software@cbs.dtu.dk.
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
#### Citation:
##### Original paper
Aron Eklund's avatar
Aron Eklund 已提交
Bjerregaard AM, Nielsen M, Hadrup SR, Szallasi Z, Eklund AC.  
Aron Eklund's avatar
Aron Eklund 已提交
MuPeXI: Prediction of neo-epitopes from tumor sequencing data.  
Cancer Immunol Immunother. 2017 Apr 20. doi: 10.1007/s00262-017-2001-3.  
Aron Eklund's avatar
Aron Eklund 已提交
PubMed ID: [28429069](https://www.ncbi.nlm.nih.gov/pubmed/28429069)  
You can read the paper here: http://rdcu.be/rwVP
ambj's avatar
ambj 已提交

##### Murine compatible version  
For use of the murine species option additonal citation of the following would be highly appreciated. 
Bjerregaard AM, Pedersen TK, Marquard AM, Hadrup SR.
Prediction of neoepitopes from murine sequencing data.
Cancer Immunol Immunother. 2018 Oct 05. doi: 10.1007/s00262-018-2254-5.
PubMed ID: [30291365](https://www.ncbi.nlm.nih.gov/pubmed/30291365)
You can read the paper here: https://rdcu.be/8ExV
#### Web servers:
Aron Eklund's avatar
Aron Eklund 已提交
For limited data, MuPeXI can be run on our
human specific [web server](http://www.cbs.dtu.dk/services/MuPeXI/)
and mouse specific [web server](http://www.cbs.dtu.dk/services/MuPeXI-mouse/)
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
## Dependencies
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
#### Hardware:
Aron Eklund's avatar
Aron Eklund 已提交
MuPeXI currently runs only on x86_64 machines running Linux or Darwin.
ambj's avatar
ambj 已提交

#### Required software:
* [Python 2.7](https://www.python.org/download/releases/2.7/)
* [NetMHCpan 4.0](http://www.cbs.dtu.dk/cgi-bin/nph-sw_request?netMHCpan)
Aron Eklund's avatar
Aron Eklund 已提交
* [Variant Effect Predictor (VEP)](http://www.ensembl.org/info/docs/tools/vep/index.html) 
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
#### Required Python modules:
ambj's avatar
ambj 已提交
* [Biopython](http://biopython.org/wiki/Download)
* [numpy](http://www.numpy.org/)
* [pandas](http://pandas.pydata.org/)

Aron Eklund's avatar
Aron Eklund 已提交
Note: These modules are already included if using Python through
Aron Eklund's avatar
Aron Eklund 已提交
[Anaconda](https://www.continuum.io/downloads).
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
#### Optional software, required only for liftover from HG19
Aron Eklund's avatar
Aron Eklund 已提交
* [Picard tools](https://broadinstitute.github.io/picard/)
* [Java 8](https://java.com/en/download/help/linux_x64rpm_install.xml)
ambj's avatar
ambj 已提交


## Installation  

Aron Eklund's avatar
Aron Eklund 已提交
1. Install all software listed above.
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
2. Download or clone the MuPeXI repository to your local system
ambj's avatar
ambj 已提交

        git clone https://github.com/ambj/MuPeXI.git

Aron Eklund's avatar
Aron Eklund 已提交
3. Obtain the reference files from GRCh38. These include cDNA, peptide and COSMIC
files; see the References section in the [user manual](/doc/MuPeXI_User_Manual.md)
for a detailed description.
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
4. Fill in the config.ini file  
    * Provide the full path to NetMHCpan and VEP.
Aron Eklund's avatar
Aron Eklund 已提交
    * Provide the full path to the reference files:
ambj's avatar
ambj 已提交
        - cDNA
        - peptide
Aron Eklund's avatar
Aron Eklund 已提交
        - COSMIC
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
   Additional peptide references and liftover paths can be provided in the config.ini
   file; see the user manual for detailed information. Instructions on how to fill in 
   the config.ini file are found within the file. `config.ini` is automatically found if 
   it is in the same directory as `MuPeXI.py` script, but it can also be placed elsewhere
   and specified by the `-c` option. 
ambj's avatar
ambj 已提交


## Usage  

Aron Eklund's avatar
Aron Eklund 已提交
Here is a simple example in which somatic mutation calls and gene expression data are
provided, and MHC binding is predicted for HLA types HLA-A01:01 and HLA-B08:01. 
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
    path/to/MuPeXI.py -v mutations.vcf -a HLA-A01:01,HLA-B08:01 -e expression.tsv
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
MuPeXI can be used for both peptide extraction, giving immunogenicity information for
peptide selection (the default), and for generation of a FASTA-formatted mutant-peptide
file suitable for input to mass spectrometry peptide search software (with the `-f` 
option). 
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
All options can be displayed using the usage information with the `-h` option:   
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
    path/to/MuPeXI.py -h
Aron Eklund's avatar
Aron Eklund 已提交
## User Manual 
For detailed information about usage, input and output files, test examples and data
preparation read the [MuPeXI User Manual](/doc/MuPeXI_User_Manual.md)
Aron Eklund's avatar
Aron Eklund 已提交
## FAQs
ambj's avatar
ambj 已提交

Aron Eklund's avatar
Aron Eklund 已提交
* We pronounce it moo-PECKS-ee
ambj's avatar
ambj 已提交


## Contact   

Anne-Mette Bjerregaard  
ambj's avatar
ambj 已提交
ambj@bioinformatics.dtu.dk
ambj's avatar
ambj 已提交

or 

Aron Charles Eklund  
ambj's avatar
ambj 已提交
eklund@bioinformatics.dtu.dk
ambj's avatar
ambj 已提交


Department of Bio and Health Informatics  
Technical University of Denmark  
Aron Eklund's avatar
Aron Eklund 已提交
http://www.bioinformatics.dtu.dk/english


## Algorithmic Flow Chart  

![](/doc/Mupexi_flow_chart.png)