Skip to content
Snippets Groups Projects
intro.rst 1.86 KiB
Newer Older
Introduction and setup
=======================

MHCflurry is a peptide/MHC I binding affinity prediction package written in
Python. It aims to provide state of the art accuracy in a documented, fast, and
open source implementation.

MHCflurry users may download trained predictors fit to affinity measurements
deposited in IEDB. The complete workflow to generate these models
is available in the "downloads_generation/models_class1" directory in the
repository. It is also easy for users with their own data to fit their own models.

Currently only allele-specific prediction is implemented, in which separate models
are trained for each allele. The released models therefore support a fixed set of common
class I alleles for which sufficient published training data is available.

MHCflurry supports Python versions 2.7 and 3.4+. It uses the Keras neural
network library via either the Tensorflow or Theano backends. GPUs may
optionally be used for a generally modest speed improvement.

If you find MHCflurry useful in your research please cite:

    O'Donnell, T. et al., 2017. MHCflurry: open-source class I MHC
    binding affinity prediction. bioRxiv. Available at:
    http://www.biorxiv.org/content/early/2017/08/09/174243.


Installation (pip)
-------------------

Install the package:

::

    pip install mhcflurry

Then download our datasets and trained models:

::

    mhcflurry-downloads fetch

From a checkout you can run the unit tests with:

::

    pip install nose
    nosetests .


Using conda
-------------

You can alternatively get up and running with a `conda <https://conda.io/docs/>`__
environment as follows. Some users have reported that this can avoid problems installing
tensorflow.

::

    conda create -q -n mhcflurry-env python=3.6 'tensorflow>=1.1.2'
    source activate mhcflurry-env

Then continue as above:

::

    pip install mhcflurry
    mhcflurry-downloads fetch