NanoPypes
==========

NanoPypes is a python package for managing and analysing ONT sequence data using distributed computing environments.


Quick-Start
===========

Install
------------
pip.::

    pip install nanopypes


Parallel basecalling with ONT's Albacore- pure python
-------------------------------------------------------
Create an instance of Albacore.::

    from nanopypes.albacore import *

    config = "path/to/yaml/config"
    albacore = Albacore(config)

Create an instance of Cluster to connect to your compute resources.::

    cluster = Cluster(config)

Execute the basecall function.::

    from nanopypes.nanopypes import basecall

    basecall(albacore, cluster)

Parallel basecalling with ONT's Albacore- command line
-------------------------------------------------------
Single command- pass a config and/or individual parameters.::

    parallel_basecaller path/to/yaml/config

For a full list of parameters.::

    parallel_basecaller --help


Building the yaml config file.
------------------------------
Create a .yml file with the following parameters.::

    basecall_config:
        input_path: path/to/raw/minion/data
        save_path: path/to/basecall/output
        flowcell: FLO-MIN106 # Flow cell used in sequencing run
        kit: SQK-LSK109 # Kit used in sequencing run
        barcoding: False
        output_format: fast5
        worker_threads: 1 # Albacore worker_threads- best to keep at 1
        recursive: False # all data will be non-recursive when Albacore is called
        reads_per_fastq: 1000
        cluster:
            job_time: 06:00  # May need optimizing based on number of reads
            mem: 40000 # Should be scaled with ncpus, workers and cores
            ncpus: 20 #1 worker per cpu
            project: /path/to/project/on/cluster
            queue: long
            workers: 20
            cores: 20
            memory: 40 GB
            scale_value: 100  # 100 scale_value / 20 cpus = 5 jobs
            cluster_type: LSF


.. toctree::
   :maxdepth: 2
   :caption: Contents:

   readme
   installation
   usage
   modules
   contributing
   authors
   history

Indices and tables
==================
* :ref:`genindex`
* :ref:`modindex`
* :ref:`search`