Release Notes#

Version 1.2.0#

1.2.0 : 2024-03-30#

Features:

st.io.read_gem and st.io.read_gef support expression matrix files with geneID information.
Analysis results of find_marker_genes will be saved into the output AnnData h5ad.
Upgraded tissue segmentation algorithm.
Addition of st.tl.adjusted_rand_score to calculate the adjusted Rand coefficient between two clusters.
Addition of st.tl.silhouette_score to calculate the average silhouette coefficient of a cluster.
h5ad2rds.R is compatible with AnnData version > 0.7.5, to convert from h5ad to rds files.
Addition of the clustering category labels to the graph of st.plt.paga_compare.

BUG Fixes:

Fixed the error of high memory consumption when converting X.raw into AnnData.

Version 1.1.0#

1.1.0 : 2024-01-17#

Features:

Reconstructed st.plt.violin visualizing function which is now not only applied to display QC indicators;
ins.export_high_res_area can handle expression matrix and image simultaneously, to lasso region of interest and corresponding sub-image.
Interactive visualizing st.plt.cells_plotting supported displaying expression heatmap and spatial distribution of a single gene.
When input GEF and GEM at cell level, information of DNB count and cell area would be added into cells / obs, and cell border would be added into cells_matrix / obsm.

BUG Fixes:

slideio package removed historical versions, resulting in an installation failure.
Calculating error when performing ms_data.tl.batch_qc, due to abnormal os.getlogin.
st.plt.paga_time_series_plot indicated that the image was too large to draw, due to unprocessed boundary values when computing median.

Version 1.0.0#

1.0.0 : 2023-12-04#

Features:

Addition of GPU acceleration on SinlgeR for large-volume data, and optimized calculating based on CPU version.
Addition of st.plt.elbow to visualize PCA result, for appropriate number of pcs.
Addition of color, max, min setting for colorbar, when plotting heatmap.
Addition of cell segmentation of Deep Learning Model V1_Pro, which is improved based on V1.
Supplemented parameters of st.plt.auc_heatmap and st.plt.auc_heatmap_by_group, full access to seaborn.clustermap;
Addition of thread and seed setting in st.tl.umap, of which the default method have been changed to single thread with the sacrifice of computational efficiency to ensure reproducibility of results. More in https://umap-learn.readthedocs.io/en/latest/reproducibility.html.
Modification of computing method of bin coordinates when reading GEM, consistent with GEF.
Optimized st.io.stereo_to_anndata for efficient format conversion.
Renamed st.tl.spatial_alignment function as st.tl.paste.
export_high_res_area removed parameter cgef.

BUG Fixes:

Occasional square-hollowing area in Deep Learning Model V3 of cell segmentation processing.
st.tl.annotation could not set two or more clusters as a same name.
The data object ins.selected_exp_data obtained from st.plt.interact_spatial_scatter could not be used for subsequent analysis.
Part of data was missing when performed st.plt.interact_spatial_scatter to output high-resolution matrix in GEF format.
Some files met reading error, led by no default setting of bin_type and bin_size in st.io.read_h5ms.
Error in Batch QC calculation due to data type problem.
There is NaN in Cell Community Detection output after threshold filtering, resulting in a calculating error when performed Find marker genes based on it.
st.plt.paga_time_series_plot indicated the image is too large to draw, leading to graph overlap, due to the limitation of matplotlib package.

Version 0.14.0b1 (Beta)#

0.14.0b1 : 2023-9-15#

Notice: this Beta version is specifically developed for multi-sample analysis.

Features:

Addition of Cell Community Detection (CCD) analysis.
Addition of Cell Co-occurrence analysis.
Addition of Cellpose in cell segmentation, especially for cell cytoplasm using model_type='cyto2'.
Addition of circos (st.plt.ccc_circos_plot) and sankey (st.plt.ccc_sankey_plot) plots in Cell-cell Communication analysis.
Addition of volcano (st.plt.TVG_volcano_plot) and tree (st.plt.time_series_tree_plot) plots in Time Series analysis.
Addition of PAGA tree plot, st.plt.paga_plot.
Addition of visuallization of st.tl.dendrogram.
Addition of version check using st.__version__.
Supported obtain subset from a data object, using clustering output, by st.tl.filter_by_clusters.
Supported filtering data using hvgs, by st.tl.filter_by_hvgs.
Supported mapping the clustering result of SquareBin analysis to the same data but in CellBin.
Supported writing annotation information into CellBin GEF file, only clustering result available before.
Supported saving images of PNG and PDF formats, in interactive interface.
Optimized the function of st.tl.find_marker_genes.
Optimized the modification of titles in horizontal axis, vertical axis and plot.

BUG Fixes:

Fixed the issue that SingleR calculating did not add filtration to the column field when traversing expression matrix, resulting in the subsequent absence of the column index.
Fixed the issue that output Seurat h5ad could not be transformed into R format.
Fixed the issue that clustering output of Leiden was in wrong data type under the scene of GPU acceleration, leading to errors in subsequent analysis which work on the clustering result.
Fixed the issue that clustering result could not be written into GEF file, using st.io.update_gef, caused by data type error. From v0.12.1 on, date.cells.cell_name has changed from int to string.

Version 0.13.0b1 (Beta)#

0.13.0b1 : 2023-07-11#

Notice: this Beta version is specifically developed for multi-sample analysis. Major update points are listed below.

Addition of 3D Cell-cell Communication.
Addition of 3D Gene Regulatory Network.
Addition of Trajectory Inference, including PAGA and DPT algorithms.
Addition of Batch QC function for evaluation on batch effect.
Addition of st.io.read_h5ad for improved compatibility with AnnData H5ad, we highly recommend that instead of st.io.read_ann_h5ad.
Addition of analysis workflow tutorial based on multi-sample data, with assistant parameters scope and mode.
Addition of resetting the image order of multi-sample analysis results.
Addition of 3D mesh visualization.
Improved the performance of Gaussian Smoothing.

Version 0.12.1#

0.12.1 : 2023-06-21#

Addition of the pretreatment of calculating quality control metrics at the start of st.tl.filter_genes and st.tl.filter_cells.
Fixed the bug that loaded data from GEF file had the same expression matrix but in different row order, through updating gefpy package to v0.6.24.
Fixed the bug that scale.data had np.nan value in st.tl.sctransform , caused by data type limitation.
Fixed the bug that dot symbol ( ‘.’ ) caused identification error of cluster name in .csv output, when doing st.tl.find_marker_genes.

Version 0.12.0#

0.12.0 : 2023-04-27#

Addition of the algorithm of Cell Segmentation V3.0.
Addition of method='hotspot' to st.tl.regulatory_network_inference, which takes spatial coordinate information into account to calculate the relative importance between TFs and their target genes.
Addition of dpi and width/height setting for visualization, and addition of plotting scale for displaying static plot.
Optimized required memory while plotting UMAP embedding via data.plt.umap and cell distribution via data.plt.cells_plotting.
Fixed bug that input parameter of var_features_n was invalid, in data.tl.scTransform.
Updated requirements.txt.

Version 0.11.0#

0.11.0 : 2023-04-04#

Addition of Cell-cell Communication analysis.
Addition of Gene Regulatory Network analysis.
Addition of SingleR function for automatic annotation.
Addition of v2 algorithm fast cell correction.
Addition of dot plot to display gene-level results.
Addition of the sorting function and the limitation of output genes in data.tl.find_marker_genes.
Added pct and pct_rest to the output files of marker genes.
Addition of the parameter mean_uni_gt in data.tl.filter_genes to filter genes on average expression.
Fixed the bug that adata.X to output AnnData was the raw matrix.
Fixed the failed compatibility to analysis results from .h5ad (version <= 0.9.0).
Updated the tissue segmentation algorithm in the module of cell segmentation to avoid the lack of tissue.
Reconstructed the manual of Stereopy.
Updated requirements.txt.

Version 0.10.0#

0.10.0 :2023-02-22#

Supported installation on Windows.
Addition of displaying basic information of StereoExpData object when simply typing it.
Addition of saving static results plots.
Addition of marker gene proportion (optional), in-group and out-of-group, in data.tl.find_marker_genes. Otherwise, supported filtering marker genes via data.tl.filter_marker_genes.
Supported adapting to AnnData, to directly use data and results stored in AnnData for subsequent analysis.
Addition of the matrix of gene count among clusters so that transformed output .rds file could be used for annotation by SingleR directly.
Initial release of Stereopy development solution.
Updated requirements.txt.

Version 0.9.0#

0.9.0 : 2023-01-10#

Resolved cell boundary overlapping issues during cell correction visualization.
Addition of manually annotating cells and clusters via command lines or interactive visualization features.
Addition of GPU version of UMAP, Neighbors, Leiden, and Louvian.
Updated requirements.txt.

Version 0.8.0#

0.8.0 : 2022-12-02#

Reconstructed scTransform normalization in Stereopy.
Optimized the efficiency of fast-cell-correction.
Enabled to read Seurat output .h5ad file for further analysis.

Version 0.7.0#

0.7.0 : 2022-11-15#

Supported acquiring the cell expression matrix (cellbin) from GEM file.
Updated hotspot to the latest version. Allow to output gene lists for every module.
Allowed to merge and arrange more than two matrices in a row.
Speeded up Stereopy installation and allowed installing heavy frameworks, such as, TensorFlow and PyTorch later before using.
Updated requirements.txt.

Version 0.6.0#

0.6.0 : 2022-09-30#

Added ‘Remove Batch Effect’ algorithm.
Added RNA velocity analysis.
Added export_high_res_area method to export high resolution matrix file(cell bin GEF) after lasso operation.
Updated algorithm of scale.
Optimized the efficiency of cell correction.
Increased multi-chip fusion analysis.
Updated requirements.txt.

Version 0.5.1#

0.5.1 : 2022-09-4#

Fixed bug when using GEM file to run fast-cell-correction algorithm.

Version 0.5.0#

0.5.0 : 2022-09-2#

Added fast-cell-correction algorithm.
Updated gmm-cell-correction algorithm(slower version), and fixed bug that genes in the same position(bin) were assigned to different cells.
Added data.plt.cells_plotting method to display cell details.
Added data.tl.export_high_res_area method to export high resolution matrix file(GEF) after lasso.
Increased tissue_extraction_to_bgef method to extract the tissue area.
Updated algorithm of highly_variable_genes, umap and normalization.
Updated requirements.txt.

Version 0.4.0#

0.4.0 : 2022-07-30#

Updated tissue segmentation algorithm.
Added the n_jobs parameter in st.tl.neighbors and st.tl.phenograph.
Added st.io.read_gef function filtered by the list of gene region.
Updated requirements.txt.

Version 0.3.1#

0.3.1 : 2022-06-30#

Added gaussian smooth function.
Added the svd_solver parameter in data.tl.pca.
Added the output parameter in st.io.write_h5ad.
Updated requirements.txt.

Version 0.3.0#

0.3.0 : 2022-06-10#

Added cell bin correction function.
Added data.tl.scale function in normalization.
Supported writing StereoExpData object into a GEF file.
Fixed bug of scTransform, reading the GEF/GEM file and annh5ad2rds.R.
Updated default cluster groups to start at 1.
Supported writing StereoExpData to stereo .h5ad function.
Updated requirements.txt.

Version 0.2.4#

0.2.4 : 2022-01-19#

Fixed bug of tar package.

Version 0.2.3#

0.2.3 : 2022-01-17#

Added cell segmentation and tissue segmentation function.
Updated stereo_to_anndata function and supported output to .h5ad file.
Added the Rscript supporting h5ad file(with anndata object) to rds file.
Supported differentially expressed gene (DEG) output to the .csv file.

Version 0.2.2#

0.2.2 : 2021-11-17#

Optimized the performance of finding marker genes.
Added Cython setup_build function and optimized IO performance of GEF.
Added hotspot pipeline for spatial data and Squidpy for spatial_neighbor function.
Added polygon selection for interactive scatter plot and simplify the visualization part of the code.

Version 0.2.1#

0.2.1 : 2021-10-15#

Fixed the bug of marker_genes_heatmap IndexError and sorted the text of heatmap plot.
Inverted yaxis on the top for spatial_scatter and cluster_scatter plot funcs.
Solved the problem that multiple results of sctransform run were inconsistent.
Updated requirements.txt.

Version 0.2.0#

0.2.0 : 2021-09-16#

Stereopy provides the analysis process based on spatial omics, including reading, preprocessing, clustering, differential expression testing and visualization, etc. There are the updates we made in this version.

We proposed StereoExpData, which is a data format specially adapted to spatial omics analysis.
Supported reading the GEF file, which is faster than reading GEM file.
Supported the conversion between StereoExpData and AnnData.
Added the interactive visualization function for selected data, you can dynamically select the area of interest, and then perform the next step of analysis.
Supported dynamically displaying clustering scatter plots, you can modify the color and point size.
Updated clustering related methods, such as leiden, louvain, which are comparable to the original algorithms.
Added some analysis, such as the method of logres for find marker genes, highly variable genes analysis, sctransform method of normalization like Seruat.

0.1.0 : 2021-05-30#

Initial release