scranpy.aggregation package#

Submodules#

scranpy.aggregation.aggregate_across_cells module#

class scranpy.aggregation.aggregate_across_cells.AggregateAcrossCellsOptions(compute_sums=True, compute_detected=True, assay_type=0, num_threads=1)[source]#

Bases: object

Options to pass to aggregate_across_cells().

compute_sums#: Whether to compute the sum of each group.

compute_detected#: Whether to compute the number of detected cells in each group.

assay_type#: Assay to use from input if it is a SummarizedExperiment.

num_threads#: Number of threads.

__annotations__ = {'assay_type': typing.Union[int, str], 'compute_detected': <class 'bool'>, 'compute_sums': <class 'bool'>, 'num_threads': <class 'int'>}#

__dataclass_fields__ = {'assay_type': Field(name='assay_type',type=typing.Union[int, str],default=0,default_factory=<dataclasses._MISSING_TYPE object>,init=True,repr=True,hash=None,compare=True,metadata=mappingproxy({}),_field_type=_FIELD), 'compute_detected': Field(name='compute_detected',type=<class 'bool'>,default=True,default_factory=<dataclasses._MISSING_TYPE object>,init=True,repr=True,hash=None,compare=True,metadata=mappingproxy({}),_field_type=_FIELD), 'compute_sums': Field(name='compute_sums',type=<class 'bool'>,default=True,default_factory=<dataclasses._MISSING_TYPE object>,init=True,repr=True,hash=None,compare=True,metadata=mappingproxy({}),_field_type=_FIELD), 'num_threads': Field(name='num_threads',type=<class 'int'>,default=1,default_factory=<dataclasses._MISSING_TYPE object>,init=True,repr=True,hash=None,compare=True,metadata=mappingproxy({}),_field_type=_FIELD)}#

__dataclass_params__ = _DataclassParams(init=True,repr=True,eq=True,order=False,unsafe_hash=False,frozen=False)#

__eq__(other)#: Return self==value.

__hash__ = None#

__repr__()#: Return repr(self).

assay_type: Union[int, str] = 0#

compute_detected: bool = True#

compute_sums: bool = True#

num_threads: int = 1#

scranpy.aggregation.aggregate_across_cells.aggregate_across_cells(input, groups, options=AggregateAcrossCellsOptions(compute_sums=True, compute_detected=True, assay_type=0, num_threads=1))[source]#

Aggregate expression values for groups of cells.

Parameters:

input (Union[TatamiNumericPointer, SummarizedExperiment]) –
Matrix-like object where rows are features and columns are cells, typically containing expression values of some kind. This should be a matrix class that can be converted into a TatamiNumericPointer.

Alternatively, a SummarizedExperiment containing such a matrix in its assays.

Developers may also provide a TatamiNumericPointer directly.
groups (Union[Sequence, Tuple[Sequence], dict, BiocFrame]) – A sequence of length equal to the number of columns of input, specifying the group to which each column is assigned. Alternatively, a tuple, dictionary, or BiocFrame of one or more such sequences, in which case each unique combination of levels across all sequences is defined as a “group”.
options (AggregateAcrossCellsOptions) – Further options.

Return type:

SummarizedExperiment

Returns:

A SummarizedExperiment where each row corresponds to a row in input and each column corresponds to a group. Assays contain the sum of expression values (if options.compute_sums = True) and the number of cells with detected expression (if options.compute_detected = True) for each group. Column data contains the identity of each group; for groups containing multiple sequences, the identity of each group is defined as a unique combination of levels from each sequence.

scranpy.aggregation.downsample_by_neighbors module#

class scranpy.aggregation.downsample_by_neighbors.DownsampleByNeighborsOptions[source]#

Bases: object

Options to pass to ~scranpy.aggregation.downsample_by_neighbors.downsample_by_neighbors.

num_threads#: Number of threads to use.

__annotations__ = {'num_threads': <class 'int'>}#

num_threads: int = 1#

scranpy.aggregation.downsample_by_neighbors.downsample_by_neighbors(input, k, options=<scranpy.aggregation.downsample_by_neighbors.DownsampleByNeighborsOptions object>)[source]#

Downsample a dataset by its neighbors. We do by considering a cell to be a “representative” of its nearest neighbors, allowing us to downsample by removing all of its neighbors; this is repeated until all cells are assigned to a representative, starting from the cells in the densest part of the dataset and working our way down. This approach aims to preserve the relative density of points for a faithful downsampling while guaranteeing the representation of rare subpopulations.