BiocPy: Facilitate Bioconductor Workflows in Python

Author
Published

January 14, 2026

Welcome to BiocPy

TipLooking for the older website?

A previous version of this book is published here and the Bioconductor workshop here (from 2024).

BiocPy is designed to build a bridge between the mature Bioconductor ecosystem and the Python landscape. Bioconductor is an open-source software project that provides tools for the analysis and comprehension of genomic data. One of the main advantages of Bioconductor is the availability of standard data representations and large number of analysis tools tailored for genomic experiments. These data structures allow researchers to seamlessly store, manipulate, and analyze data across multiple packages and workflows in R.

Inspired by Bioconductor, BiocPy aims to facilitate Bioconductor workflows in Python. To achieve this goal, we developed several core data structures that align closely to the Bioconductor implementations. By implementing these core Bioconductor data structures, BiocPy allows data to be easily interoperable between R and Python.

About this Book

This book is currently in active development and is organized into the following sections:

  • Foundations: The core data structures that underpin the ecosystem.
  • BioC Hubs: Access to Bioconductor’s cloud resources.
  • Interoperability: Tools to bridge R and Python.
  • Workflows: Real-world usage examples.

Further Reading

Many online resources offer detailed information on Bioconductor data structures, namely: