multiprocessing on SciComp Blog

multiprocessing on SciComp Blog https://mpievolbio-scicomp.pages.gwdg.de/blog/tags/multiprocessing/ Recent content in multiprocessing on SciComp Blog Hugo -- gohugo.io en Fri, 22 Jan 2021 00:00:00 +0000 Dask and Jupyter https://mpievolbio-scicomp.pages.gwdg.de/blog/post/2021-01-22_dask/ Fri, 22 Jan 2021 00:00:00 +0000 https://mpievolbio-scicomp.pages.gwdg.de/blog/post/2021-01-22_dask/ Parallel python with dask and jupyter The dask framework provides an incredibly useful environment for parallel execution of python code in interactive settings (e.g. jupyter) or batch mode. Its key features are (from what I’ve seen so far): Representation of threading, multiprocessing, and distributed computing with one unified API and CLI. Abstraction of HPC schedulers (PBS, Moab, SLURM, …) Data structures for distributed computing with pandas and numpy syntax Dask-jobqueue The package dask_jobqueue seems to me to be the most userfriendly if it comes to parallelization on HPC clusters with a scheduling system such as SLURM.