Release Notes¶
Version 1.3.0¶
1.3.0 : 2024-05-31¶
Features:
Addition of MSData.tl.st_gears for spatial alignment of Multi-sample.
High Resolution Matrix Export can support both GEF and GEM files.
Addition of parameters
min_countandmax_countfor st.tl.filter_genes.MSData.integrate can be compatible with sparse matrix when
MSData.var_typeisunion.Addition of MSData.tl.set_scope_and_mode to set
scopeandmodeglobally on Multi-sample analysis.Addition of MSData.plt.ms_spatial_scatter to plot spatial scatter plot for each sample in Multi-sample separately.
BUG Fixes:
Fixed the problem that
st.io.read_gemis incompatible with GEM files containing geneID.Fixed the bug of losing part of metadata when writing StereoExpData / MSData into Stereo-h5ad or h5ms file.
Fixed the incompatibility problem with AnnData when performing
st.tl.sctransform.
Version 1.2.0¶
1.2.0 : 2024-03-30¶
Features:
st.io.read_gemandst.io.read_gefsupport expression matrix files with geneID information.Analysis results of
find_marker_geneswill be saved into the output AnnData h5ad.Upgraded tissue segmentation algorithm.
Addition of
st.tl.adjusted_rand_scoreto calculate the adjusted Rand coefficient between two clusters.Addition of
st.tl.silhouette_scoreto calculate the average silhouette coefficient of a cluster.h5ad2rds.Ris compatible with AnnData version > 0.7.5, to convert from h5ad to rds files.Addition of the clustering category labels to the graph of
st.plt.paga_compare.
BUG Fixes:
Fixed the error of high memory consumption when converting
X.rawinto AnnData.
Version 1.1.0¶
1.1.0 : 2024-01-17¶
Features:
Reconstructed
st.plt.violinvisualizing function which is now not only applied to display QC indicators;ins.export_high_res_areacan handle expression matrix and image simultaneously, to lasso region of interest and corresponding sub-image.Interactive visualizing
st.plt.cells_plottingsupported displaying expression heatmap and spatial distribution of a single gene.When input GEF and GEM at cell level, information of DNB count and cell area would be added into
cells/obs, and cell border would be added intocells_matrix/obsm.
BUG Fixes:
slideiopackage removed historical versions, resulting in an installation failure.Calculating error when performing
ms_data.tl.batch_qc, due to abnormalos.getlogin.st.plt.paga_time_series_plotindicated that the image was too large to draw, due to unprocessed boundary values when computing median.
Version 1.0.0¶
1.0.0 : 2023-12-04¶
Features:
Addition of GPU acceleration on SinlgeR for large-volume data, and optimized calculating based on CPU version.
Addition of
st.plt.elbowto visualize PCA result, for appropriate number of pcs.Addition of color, max, min setting for colorbar, when plotting heatmap.
Addition of cell segmentation of
Deep Learning Model V1_Pro, which is improved based onV1.Supplemented parameters of
st.plt.auc_heatmapandst.plt.auc_heatmap_by_group, full access toseaborn.clustermap;Addition of thread and seed setting in
st.tl.umap, of which the default method have been changed to single thread with the sacrifice of computational efficiency to ensure reproducibility of results. More in https://umap-learn.readthedocs.io/en/latest/reproducibility.html.Modification of computing method of bin coordinates when reading GEM, consistent with GEF.
Optimized
st.io.stereo_to_anndatafor efficient format conversion.Renamed
st.tl.spatial_alignmentfunction asst.tl.paste.export_high_res_arearemoved parametercgef.
BUG Fixes:
Occasional square-hollowing area in
Deep Learning Model V3of cell segmentation processing.st.tl.annotationcould not set two or more clusters as a same name.The data object
ins.selected_exp_dataobtained fromst.plt.interact_spatial_scattercould not be used for subsequent analysis.Part of data was missing when performed
st.plt.interact_spatial_scatterto output high-resolution matrix in GEF format.Some files met reading error, led by no default setting of
bin_typeandbin_sizeinst.io.read_h5ms.Error in Batch QC calculation due to data type problem.
There is NaN in Cell Community Detection output after threshold filtering, resulting in a calculating error when performed Find marker genes based on it.
st.plt.paga_time_series_plotindicated the image is too large to draw, leading to graph overlap, due to the limitation of matplotlib package.
Version 0.14.0b1 (Beta)¶
0.14.0b1 : 2023-9-15¶
Notice: this Beta version is specifically developed for multi-sample analysis.
Features:
Addition of Cell Community Detection (CCD) analysis.
Addition of Cell Co-occurrence analysis.
Addition of Cellpose in cell segmentation, especially for cell cytoplasm using
model_type='cyto2'.Addition of circos (
st.plt.ccc_circos_plot) and sankey (st.plt.ccc_sankey_plot) plots in Cell-cell Communication analysis.Addition of volcano (
st.plt.TVG_volcano_plot) and tree (st.plt.time_series_tree_plot) plots in Time Series analysis.Addition of PAGA tree plot,
st.plt.paga_plot.Addition of visuallization of
st.tl.dendrogram.Addition of version check using
st.__version__.Supported obtain subset from a data object, using clustering output, by
st.tl.filter_by_clusters.Supported filtering data using hvgs, by
st.tl.filter_by_hvgs.Supported mapping the clustering result of SquareBin analysis to the same data but in CellBin.
Supported writing annotation information into CellBin GEF file, only clustering result available before.
Supported saving images of PNG and PDF formats, in interactive interface.
Optimized the function of
st.tl.find_marker_genes.Optimized the modification of titles in horizontal axis, vertical axis and plot.
BUG Fixes:
Fixed the issue that SingleR calculating did not add filtration to the column field when traversing expression matrix, resulting in the subsequent absence of the column index.
Fixed the issue that output Seurat h5ad could not be transformed into R format.
Fixed the issue that clustering output of Leiden was in wrong data type under the scene of GPU acceleration, leading to errors in subsequent analysis which work on the clustering result.
Fixed the issue that clustering result could not be written into GEF file, using
st.io.update_gef, caused by data type error. From v0.12.1 on,date.cells.cell_namehas changed from int to string.
Version 0.13.0b1 (Beta)¶
0.13.0b1 : 2023-07-11¶
Notice: this Beta version is specifically developed for multi-sample analysis. Major update points are listed below.
Addition of 3D Cell-cell Communication.
Addition of 3D Gene Regulatory Network.
Addition of Trajectory Inference, including PAGA and DPT algorithms.
Addition of Batch QC function for evaluation on batch effect.
Addition of
st.io.read_h5adfor improved compatibility with AnnData H5ad, we highly recommend that instead ofst.io.read_ann_h5ad.Addition of analysis workflow tutorial based on multi-sample data, with assistant parameters
scopeandmode.Addition of resetting the image order of multi-sample analysis results.
Addition of 3D mesh visualization.
Improved the performance of Gaussian Smoothing.
Version 0.12.1¶
0.12.1 : 2023-06-21¶
Addition of the pretreatment of calculating quality control metrics at the start of
st.tl.filter_genesandst.tl.filter_cells.Fixed the bug that loaded data from GEF file had the same expression matrix but in different row order, through updating gefpy package to v0.6.24.
Fixed the bug that
scale.datahadnp.nanvalue inst.tl.sctransform, caused by data type limitation.Fixed the bug that dot symbol ( ‘.’ ) caused identification error of cluster name in
.csvoutput, when doingst.tl.find_marker_genes.
Version 0.12.0¶
0.12.0 : 2023-04-27¶
Addition of the algorithm of Cell Segmentation V3.0.
Addition of
method='hotspot'tost.tl.regulatory_network_inference, which takes spatial coordinate information into account to calculate the relative importance between TFs and their target genes.Addition of dpi and width/height setting for visualization, and addition of plotting scale for displaying static plot.
Optimized required memory while plotting UMAP embedding via
data.plt.umapand cell distribution viadata.plt.cells_plotting.Fixed bug that input parameter of
var_features_nwas invalid, indata.tl.scTransform.Updated requirements.txt.
Version 0.11.0¶
0.11.0 : 2023-04-04¶
Addition of Cell-cell Communication analysis.
Addition of Gene Regulatory Network analysis.
Addition of SingleR function for automatic annotation.
Addition of
v2algorithm fast cell correction.Addition of dot plot to display gene-level results.
Addition of the sorting function and the limitation of output genes in
data.tl.find_marker_genes.Added
pctandpct_restto the output files of marker genes.Addition of the parameter
mean_uni_gtindata.tl.filter_genesto filter genes on average expression.Fixed the bug that
adata.Xto output AnnData was the raw matrix.Fixed the failed compatibility to analysis results from
.h5ad(version <= 0.9.0).Updated the tissue segmentation algorithm in the module of cell segmentation to avoid the lack of tissue.
Reconstructed the manual of Stereopy.
Updated requirements.txt.
Version 0.10.0¶
0.10.0 :2023-02-22¶
Supported installation on Windows.
Addition of displaying basic information of StereoExpData object when simply typing it.
Addition of saving static results plots.
Addition of marker gene proportion (optional), in-group and out-of-group, in
data.tl.find_marker_genes. Otherwise, supported filtering marker genes viadata.tl.filter_marker_genes.Supported adapting to AnnData, to directly use data and results stored in AnnData for subsequent analysis.
Addition of the matrix of gene count among clusters so that transformed output
.rdsfile could be used for annotation by SingleR directly.Initial release of Stereopy development solution.
Updated requirements.txt.
Version 0.9.0¶
0.9.0 : 2023-01-10¶
Resolved cell boundary overlapping issues during cell correction visualization.
Addition of manually annotating cells and clusters via command lines or interactive visualization features.
Addition of GPU version of UMAP, Neighbors, Leiden, and Louvian.
Updated requirements.txt.
Version 0.8.0¶
0.8.0 : 2022-12-02¶
Reconstructed scTransform normalization in Stereopy.
Optimized the efficiency of fast-cell-correction.
Enabled to read Seurat output
.h5adfile for further analysis.
Version 0.7.0¶
0.7.0 : 2022-11-15¶
Supported acquiring the cell expression matrix (cellbin) from GEM file.
Updated hotspot to the latest version. Allow to output gene lists for every module.
Allowed to merge and arrange more than two matrices in a row.
Speeded up Stereopy installation and allowed installing heavy frameworks, such as, TensorFlow and PyTorch later before using.
Updated requirements.txt.
Version 0.6.0¶
0.6.0 : 2022-09-30¶
Added ‘Remove Batch Effect’ algorithm.
Added RNA velocity analysis.
Added
export_high_res_areamethod to export high resolution matrix file(cell bin GEF) after lasso operation.Updated algorithm of scale.
Optimized the efficiency of cell correction.
Increased multi-chip fusion analysis.
Updated requirements.txt.
Version 0.5.1¶
0.5.1 : 2022-09-4¶
Fixed bug when using GEM file to run fast-cell-correction algorithm.
Version 0.5.0¶
0.5.0 : 2022-09-2¶
Added fast-cell-correction algorithm.
Updated gmm-cell-correction algorithm(slower version), and fixed bug that genes in the same position(bin) were assigned to different cells.
Added
data.plt.cells_plottingmethod to display cell details.Added
data.tl.export_high_res_areamethod to export high resolution matrix file(GEF) after lasso.Increased tissue_extraction_to_bgef method to extract the tissue area.
Updated algorithm of highly_variable_genes, umap and normalization.
Updated requirements.txt.
Version 0.4.0¶
0.4.0 : 2022-07-30¶
Updated tissue segmentation algorithm.
Added the
n_jobsparameter inst.tl.neighborsandst.tl.phenograph.Added
st.io.read_geffunction filtered by the list of gene region.Updated requirements.txt.
Version 0.3.1¶
0.3.1 : 2022-06-30¶
Added gaussian smooth function.
Added the
svd_solverparameter indata.tl.pca.Added the
outputparameter inst.io.write_h5ad.Updated requirements.txt.
Version 0.3.0¶
0.3.0 : 2022-06-10¶
Added cell bin correction function.
Added
data.tl.scalefunction in normalization.Supported writing StereoExpData object into a GEF file.
Fixed bug of scTransform, reading the GEF/GEM file and annh5ad2rds.R.
Updated default cluster groups to start at 1.
Supported writing StereoExpData to stereo
.h5adfunction.Updated requirements.txt.
Version 0.2.4¶
0.2.4 : 2022-01-19¶
Fixed bug of tar package.
Version 0.2.3¶
0.2.3 : 2022-01-17¶
Added cell segmentation and tissue segmentation function.
Updated stereo_to_anndata function and supported output to
.h5adfile.Added the Rscript supporting h5ad file(with anndata object) to rds file.
Supported differentially expressed gene (DEG) output to the
.csvfile.
Version 0.2.2¶
0.2.2 : 2021-11-17¶
Optimized the performance of finding marker genes.
Added Cython setup_build function and optimized IO performance of GEF.
Added hotspot pipeline for spatial data and Squidpy for spatial_neighbor function.
Added polygon selection for interactive scatter plot and simplify the visualization part of the code.
Version 0.2.1¶
0.2.1 : 2021-10-15¶
Fixed the bug of marker_genes_heatmap IndexError and sorted the text of heatmap plot.
Inverted yaxis on the top for spatial_scatter and cluster_scatter plot funcs.
Solved the problem that multiple results of sctransform run were inconsistent.
Updated requirements.txt.
Version 0.2.0¶
0.2.0 : 2021-09-16¶
Stereopy provides the analysis process based on spatial omics, including reading, preprocessing, clustering, differential expression testing and visualization, etc. There are the updates we made in this version.
We proposed StereoExpData, which is a data format specially adapted to spatial omics analysis.
Supported reading the GEF file, which is faster than reading GEM file.
Supported the conversion between StereoExpData and AnnData.
Added the interactive visualization function for selected data, you can dynamically select the area of interest, and then perform the next step of analysis.
Supported dynamically displaying clustering scatter plots, you can modify the color and point size.
Updated clustering related methods, such as leiden, louvain, which are comparable to the original algorithms.
Added some analysis, such as the method of logres for find marker genes, highly variable genes analysis, sctransform method of normalization like Seruat.
0.1.0 : 2021-05-30¶
Initial release