Release Notes#
Version 1.2.0#
1.2.0 : 2024-03-30#
Features:
st.io.read_gem
andst.io.read_gef
support expression matrix files with geneID information.Analysis results of
find_marker_genes
will be saved into the output AnnData h5ad.Upgraded tissue segmentation algorithm.
Addition of
st.tl.adjusted_rand_score
to calculate the adjusted Rand coefficient between two clusters.Addition of
st.tl.silhouette_score
to calculate the average silhouette coefficient of a cluster.h5ad2rds.R
is compatible with AnnData version > 0.7.5, to convert from h5ad to rds files.Addition of the clustering category labels to the graph of
st.plt.paga_compare
.
BUG Fixes:
Fixed the error of high memory consumption when converting
X.raw
into AnnData.
Version 1.1.0#
1.1.0 : 2024-01-17#
Features:
Reconstructed
st.plt.violin
visualizing function which is now not only applied to display QC indicators;ins.export_high_res_area
can handle expression matrix and image simultaneously, to lasso region of interest and corresponding sub-image.Interactive visualizing
st.plt.cells_plotting
supported displaying expression heatmap and spatial distribution of a single gene.When input GEF and GEM at cell level, information of DNB count and cell area would be added into
cells
/obs
, and cell border would be added intocells_matrix
/obsm
.
BUG Fixes:
slideio
package removed historical versions, resulting in an installation failure.Calculating error when performing
ms_data.tl.batch_qc
, due to abnormalos.getlogin
.st.plt.paga_time_series_plot
indicated that the image was too large to draw, due to unprocessed boundary values when computing median.
Version 1.0.0#
1.0.0 : 2023-12-04#
Features:
Addition of GPU acceleration on SinlgeR for large-volume data, and optimized calculating based on CPU version.
Addition of
st.plt.elbow
to visualize PCA result, for appropriate number of pcs.Addition of color, max, min setting for colorbar, when plotting heatmap.
Addition of cell segmentation of
Deep Learning Model V1_Pro
, which is improved based onV1
.Supplemented parameters of
st.plt.auc_heatmap
andst.plt.auc_heatmap_by_group
, full access toseaborn.clustermap
;Addition of thread and seed setting in
st.tl.umap
, of which the default method have been changed to single thread with the sacrifice of computational efficiency to ensure reproducibility of results. More in https://umap-learn.readthedocs.io/en/latest/reproducibility.html.Modification of computing method of bin coordinates when reading GEM, consistent with GEF.
Optimized
st.io.stereo_to_anndata
for efficient format conversion.Renamed
st.tl.spatial_alignment
function asst.tl.paste
.export_high_res_area
removed parametercgef
.
BUG Fixes:
Occasional square-hollowing area in
Deep Learning Model V3
of cell segmentation processing.st.tl.annotation
could not set two or more clusters as a same name.The data object
ins.selected_exp_data
obtained fromst.plt.interact_spatial_scatter
could not be used for subsequent analysis.Part of data was missing when performed
st.plt.interact_spatial_scatter
to output high-resolution matrix in GEF format.Some files met reading error, led by no default setting of
bin_type
andbin_size
inst.io.read_h5ms
.Error in Batch QC calculation due to data type problem.
There is NaN in Cell Community Detection output after threshold filtering, resulting in a calculating error when performed Find marker genes based on it.
st.plt.paga_time_series_plot
indicated the image is too large to draw, leading to graph overlap, due to the limitation of matplotlib package.
Version 0.14.0b1 (Beta)#
0.14.0b1 : 2023-9-15#
Notice: this Beta version is specifically developed for multi-sample analysis.
Features:
Addition of Cell Community Detection (CCD) analysis.
Addition of Cell Co-occurrence analysis.
Addition of Cellpose in cell segmentation, especially for cell cytoplasm using
model_type='cyto2'
.Addition of circos (
st.plt.ccc_circos_plot
) and sankey (st.plt.ccc_sankey_plot
) plots in Cell-cell Communication analysis.Addition of volcano (
st.plt.TVG_volcano_plot
) and tree (st.plt.time_series_tree_plot
) plots in Time Series analysis.Addition of PAGA tree plot,
st.plt.paga_plot
.Addition of visuallization of
st.tl.dendrogram
.Addition of version check using
st.__version__
.Supported obtain subset from a data object, using clustering output, by
st.tl.filter_by_clusters
.Supported filtering data using hvgs, by
st.tl.filter_by_hvgs
.Supported mapping the clustering result of SquareBin analysis to the same data but in CellBin.
Supported writing annotation information into CellBin GEF file, only clustering result available before.
Supported saving images of PNG and PDF formats, in interactive interface.
Optimized the function of
st.tl.find_marker_genes
.Optimized the modification of titles in horizontal axis, vertical axis and plot.
BUG Fixes:
Fixed the issue that SingleR calculating did not add filtration to the column field when traversing expression matrix, resulting in the subsequent absence of the column index.
Fixed the issue that output Seurat h5ad could not be transformed into R format.
Fixed the issue that clustering output of Leiden was in wrong data type under the scene of GPU acceleration, leading to errors in subsequent analysis which work on the clustering result.
Fixed the issue that clustering result could not be written into GEF file, using
st.io.update_gef
, caused by data type error. From v0.12.1 on,date.cells.cell_name
has changed from int to string.
Version 0.13.0b1 (Beta)#
0.13.0b1 : 2023-07-11#
Notice: this Beta version is specifically developed for multi-sample analysis. Major update points are listed below.
Addition of 3D Cell-cell Communication.
Addition of 3D Gene Regulatory Network.
Addition of Trajectory Inference, including PAGA and DPT algorithms.
Addition of Batch QC function for evaluation on batch effect.
Addition of
st.io.read_h5ad
for improved compatibility with AnnData H5ad, we highly recommend that instead ofst.io.read_ann_h5ad
.Addition of analysis workflow tutorial based on multi-sample data, with assistant parameters
scope
andmode
.Addition of resetting the image order of multi-sample analysis results.
Addition of 3D mesh visualization.
Improved the performance of Gaussian Smoothing.
Version 0.12.1#
0.12.1 : 2023-06-21#
Addition of the pretreatment of calculating quality control metrics at the start of
st.tl.filter_genes
andst.tl.filter_cells
.Fixed the bug that loaded data from GEF file had the same expression matrix but in different row order, through updating gefpy package to v0.6.24.
Fixed the bug that
scale.data
hadnp.nan
value inst.tl.sctransform
, caused by data type limitation.Fixed the bug that dot symbol ( ‘.’ ) caused identification error of cluster name in
.csv
output, when doingst.tl.find_marker_genes
.
Version 0.12.0#
0.12.0 : 2023-04-27#
Addition of the algorithm of Cell Segmentation V3.0.
Addition of
method='hotspot'
tost.tl.regulatory_network_inference
, which takes spatial coordinate information into account to calculate the relative importance between TFs and their target genes.Addition of dpi and width/height setting for visualization, and addition of plotting scale for displaying static plot.
Optimized required memory while plotting UMAP embedding via
data.plt.umap
and cell distribution viadata.plt.cells_plotting
.Fixed bug that input parameter of
var_features_n
was invalid, indata.tl.scTransform
.Updated requirements.txt.
Version 0.11.0#
0.11.0 : 2023-04-04#
Addition of Cell-cell Communication analysis.
Addition of Gene Regulatory Network analysis.
Addition of SingleR function for automatic annotation.
Addition of
v2
algorithm fast cell correction.Addition of dot plot to display gene-level results.
Addition of the sorting function and the limitation of output genes in
data.tl.find_marker_genes
.Added
pct
andpct_rest
to the output files of marker genes.Addition of the parameter
mean_uni_gt
indata.tl.filter_genes
to filter genes on average expression.Fixed the bug that
adata.X
to output AnnData was the raw matrix.Fixed the failed compatibility to analysis results from
.h5ad
(version <= 0.9.0).Updated the tissue segmentation algorithm in the module of cell segmentation to avoid the lack of tissue.
Reconstructed the manual of Stereopy.
Updated requirements.txt.
Version 0.10.0#
0.10.0 :2023-02-22#
Supported installation on Windows.
Addition of displaying basic information of StereoExpData object when simply typing it.
Addition of saving static results plots.
Addition of marker gene proportion (optional), in-group and out-of-group, in
data.tl.find_marker_genes
. Otherwise, supported filtering marker genes viadata.tl.filter_marker_genes
.Supported adapting to AnnData, to directly use data and results stored in AnnData for subsequent analysis.
Addition of the matrix of gene count among clusters so that transformed output
.rds
file could be used for annotation by SingleR directly.Initial release of Stereopy development solution.
Updated requirements.txt.
Version 0.9.0#
0.9.0 : 2023-01-10#
Resolved cell boundary overlapping issues during cell correction visualization.
Addition of manually annotating cells and clusters via command lines or interactive visualization features.
Addition of GPU version of UMAP, Neighbors, Leiden, and Louvian.
Updated requirements.txt.
Version 0.8.0#
0.8.0 : 2022-12-02#
Reconstructed scTransform normalization in Stereopy.
Optimized the efficiency of fast-cell-correction.
Enabled to read Seurat output
.h5ad
file for further analysis.
Version 0.7.0#
0.7.0 : 2022-11-15#
Supported acquiring the cell expression matrix (cellbin) from GEM file.
Updated hotspot to the latest version. Allow to output gene lists for every module.
Allowed to merge and arrange more than two matrices in a row.
Speeded up Stereopy installation and allowed installing heavy frameworks, such as, TensorFlow and PyTorch later before using.
Updated requirements.txt.
Version 0.6.0#
0.6.0 : 2022-09-30#
Added ‘Remove Batch Effect’ algorithm.
Added RNA velocity analysis.
Added
export_high_res_area
method to export high resolution matrix file(cell bin GEF) after lasso operation.Updated algorithm of scale.
Optimized the efficiency of cell correction.
Increased multi-chip fusion analysis.
Updated requirements.txt.
Version 0.5.1#
0.5.1 : 2022-09-4#
Fixed bug when using GEM file to run fast-cell-correction algorithm.
Version 0.5.0#
0.5.0 : 2022-09-2#
Added fast-cell-correction algorithm.
Updated gmm-cell-correction algorithm(slower version), and fixed bug that genes in the same position(bin) were assigned to different cells.
Added
data.plt.cells_plotting
method to display cell details.Added
data.tl.export_high_res_area
method to export high resolution matrix file(GEF) after lasso.Increased tissue_extraction_to_bgef method to extract the tissue area.
Updated algorithm of highly_variable_genes, umap and normalization.
Updated requirements.txt.
Version 0.4.0#
0.4.0 : 2022-07-30#
Updated tissue segmentation algorithm.
Added the
n_jobs
parameter inst.tl.neighbors
andst.tl.phenograph
.Added
st.io.read_gef
function filtered by the list of gene region.Updated requirements.txt.
Version 0.3.1#
0.3.1 : 2022-06-30#
Added gaussian smooth function.
Added the
svd_solver
parameter indata.tl.pca
.Added the
output
parameter inst.io.write_h5ad
.Updated requirements.txt.
Version 0.3.0#
0.3.0 : 2022-06-10#
Added cell bin correction function.
Added
data.tl.scale
function in normalization.Supported writing StereoExpData object into a GEF file.
Fixed bug of scTransform, reading the GEF/GEM file and annh5ad2rds.R.
Updated default cluster groups to start at 1.
Supported writing StereoExpData to stereo
.h5ad
function.Updated requirements.txt.
Version 0.2.4#
0.2.4 : 2022-01-19#
Fixed bug of tar package.
Version 0.2.3#
0.2.3 : 2022-01-17#
Added cell segmentation and tissue segmentation function.
Updated stereo_to_anndata function and supported output to
.h5ad
file.Added the Rscript supporting h5ad file(with anndata object) to rds file.
Supported differentially expressed gene (DEG) output to the
.csv
file.
Version 0.2.2#
0.2.2 : 2021-11-17#
Optimized the performance of finding marker genes.
Added Cython setup_build function and optimized IO performance of GEF.
Added hotspot pipeline for spatial data and Squidpy for spatial_neighbor function.
Added polygon selection for interactive scatter plot and simplify the visualization part of the code.
Version 0.2.1#
0.2.1 : 2021-10-15#
Fixed the bug of marker_genes_heatmap IndexError and sorted the text of heatmap plot.
Inverted yaxis on the top for spatial_scatter and cluster_scatter plot funcs.
Solved the problem that multiple results of sctransform run were inconsistent.
Updated requirements.txt.
Version 0.2.0#
0.2.0 : 2021-09-16#
Stereopy provides the analysis process based on spatial omics, including reading, preprocessing, clustering, differential expression testing and visualization, etc. There are the updates we made in this version.
We proposed StereoExpData, which is a data format specially adapted to spatial omics analysis.
Supported reading the GEF file, which is faster than reading GEM file.
Supported the conversion between StereoExpData and AnnData.
Added the interactive visualization function for selected data, you can dynamically select the area of interest, and then perform the next step of analysis.
Supported dynamically displaying clustering scatter plots, you can modify the color and point size.
Updated clustering related methods, such as leiden, louvain, which are comparable to the original algorithms.
Added some analysis, such as the method of logres for find marker genes, highly variable genes analysis, sctransform method of normalization like Seruat.
0.1.0 : 2021-05-30#
Initial release