Academic Jobs Logo

NUS CellScope: Breakthrough Single-Cell Atlas Workflow Using Manifold Fitting in Nature Communications

Revolutionizing Cellular Hierarchies with Tree-Structured Atlases

Be the first to comment on this article!

You

Please keep comments respectful and on-topic.

a close up of a white substance on a blue background
Photo by National Institute of Allergy and Infectious Diseases on Unsplash

Promote Your Research… Share it Worldwide

Have a story or a research paper to share? Become a contributor and publish your work on AcademicJobs.com.

Submit your Research - Make it Global News

Revolutionizing Single-Cell Research: NUS CellScope Delivers Unprecedented Resolution

Researchers at the National University of Singapore (NUS) have introduced CellScope, a groundbreaking framework that transforms how scientists construct high-resolution cell atlases from single-cell RNA sequencing (scRNA-seq) data. Published in the prestigious journal Nature Communications on December 30, 2025, this innovation addresses longstanding challenges in analyzing cellular heterogeneity. Led by Associate Professor Zhigang Yao from NUS's Department of Statistics and Data Science, the tool promises to accelerate discoveries in biology and medicine by enabling multi-level exploration of cell types, subtypes, and lineages.

Single-cell RNA sequencing has revolutionized biology by providing gene expression profiles for thousands to millions of individual cells, revealing diversity within tissues that bulk methods obscure. However, noise from technical artifacts, dropout events, and irrelevant housekeeping genes often hampers accurate clustering and visualization. CellScope tackles these issues head-on, offering superior performance over industry standards like Seurat and Scanpy.

Understanding the Challenges in scRNA-seq Data Analysis

scRNA-seq data resides in high-dimensional space where true biological signals lie on a low-dimensional manifold—a smoother, intrinsic structure capturing cell states. Yet, observed data is distorted by two noise types: (1) housekeeping genes constant across cells, inflating noise space, and (2) technical noise like mRNA loss in signal space. Traditional pipelines struggle with scalability on large datasets (>50,000 cells), rare cell detection (<5% abundance), and hierarchical relationships, leading to suboptimal clustering (Adjusted Rand Index or ARI around 0.65-0.68).

In Singapore's vibrant biotech ecosystem, NUS researchers recognized these gaps, building on prior work like scAMF (published in PNAS, 2024), which first applied manifold fitting for denoising. CellScope extends this into a complete workflow for atlas construction.

The Core Innovation: Two-Stage Manifold Fitting in CellScope

CellScope's power stems from its two-stage manifold fitting process, mathematically grounded in density-distance metrics to separate signal from noise.

  • Stage 1: Signal Gene Selection Identifies "manifold seeds"—high-density, distant cells in PCA space—using local density (ρ) and relative distance (δ), combined into γ score. "Highly reliable cliques" form from seed neighbors. ANOVA selects top 500 genes showing low intra-clique variance but high inter-clique differences, filtering housekeeping genes.
  • Stage 2: Technical Noise Reduction Projects low-density outliers (bottom 5%) onto high-density submanifolds via estimation (t=0.9), refining positions to emphasize biology.

Post-fitting, UMAP builds a neighborhood graph (Gaussian kernel), enabling agglomerative clustering with average linkage. This yields a tree-structured representation, visualizing hierarchies from broad lineages to fine subtypes.

Diagram of CellScope two-stage manifold fitting process for scRNA-seq data

Benchmark Superiority: Outperforming Seurat and Scanpy Across 36 Datasets

Tested on 36 diverse scRNA-seq datasets (90 to 265,767 cells, human/mouse brain, pancreas, PBMC, etc.), CellScope achieved average ARI of 0.88 ± 0.014—topping Seurat (0.65) and Scanpy (0.68) on 32/36 sets (p < 10⁻⁵). It excelled in accuracy (ACC), Normalized Mutual Information (NMI), Jaccard Index (JI), and Silhouette Scores, especially for rare cells.

Runtime was fastest or competitive, scaling to massive data where others faltered. Gene selection beat Disp, VST, HVG via ASW and purity metrics. Hybrids (CellScope genes + other clustering) confirmed its edge.Read the full benchmarks in Nature Communications.

aerial view of city buildings during daytime

Photo by Bing Hui Yau on Unsplash

Tree-Structured Visualization: Unlocking Multi-Level Biological Insights

CellScope's hallmark is its intuitive tree: UMAP embeds fitted data, with hierarchical clustering overlaid, showing clusters merging upward. Genes gain dynamic identities—housekeeping (stable), moderately/strongly cell-type-related—via Wasserstein distances between siblings.

In human midbrain (Siletti-1 dataset), it resolved Oligodendrocyte subtypes (OL1/OL2) with markers like RBFOX1 (myelination regulator), unseen by others. Sankey diagrams traced transitions, e.g., from progenitors to mature cells. In COVID-19 PBMC, it separated monocyte-dendritic subsets, revealing severity-linked markers (IFIT1/OAS2) for antivirals.

Practical Advantages: Scalable, Parameter-Light, and Interpretable

Unlike hyperparameter-heavy tools, CellScope is robust across PCA dims, gene counts, seeds (ARI stable). It runs on laptops for <50k cells, subsets for larger. Open-source Python package on GitHub includes tutorials for clustering, trees, markers.

For researchers, this means faster, clearer atlases without tuning—ideal for brain, cancer, immunology studies.

NUS's Growing Leadership in Single-Cell Bioinformatics

Under Prof. Yao's group, NUS bridges stats and biology. Preceding scAMF (PNAS 2024) pioneered manifold denoising; CellScope builds hierarchical atlases. Singapore's NUS ranks top globally (QS 2026: 8th), fueling biotech via RIE2030 investments.Explore Singapore higher ed opportunities.

NUS hosts Single-Cell Res/volution events, contributes to Asian immune atlases—positioning it as Asia's hub for computational biology.

Real-World Applications: From Neuroscience to Infectious Diseases

CellScope unveils hierarchies in development (embryos), aging (Alzheimer's brain), cancer (tumor microenvironments). In COVID, it pinpointed immune shifts, aiding vaccine design. Future: personalized medicine via patient atlases, rare disease subtype discovery.

Tree-structured visualization from CellScope on human midbrain dataset

Stakeholders praise its interpretability: "Reveals insights others miss," per benchmarks.

Career Opportunities in Singapore's Booming Biotech Sector

This breakthrough highlights demand for bioinformaticians. NUS trains via stats-data science programs; Singapore offers research jobs at A*STAR, CSI. Aspiring pros can leverage tools like CellScope for theses, publications.

Check Rate My Professor for NUS faculty insights; explore academic CV tips.

Future Outlook: Transforming Global Cell Atlas Initiatives

CellScope integrates with Human Cell Atlas, accelerating multi-omics. Yao group eyes multimodal (spatial+RNA) extensions. For Singapore, it bolsters biomedical sovereignty amid R&D surge (S$37B RIE2030).

Download code, try tutorials—join the revolution in single-cell analysis. Opportunities abound in university jobs and higher ed careers.

Portrait of Dr. Elena Ramirez

Dr. Elena RamirezView full profile

Contributing Writer

Advancing higher education excellence through expert policy reforms and equity initiatives.

Acknowledgements:

Discussion

Sort by:

Be the first to comment on this article!

You

Please keep comments respectful and on-topic.

New0 comments

Join the conversation!

Add your comments now!

Have your say

Engagement level

Browse by Faculty

Browse by Subject

Frequently Asked Questions

🔬What is CellScope from NUS?

CellScope is an open-source framework for building multi-level cell atlases from scRNA-seq data, using two-stage manifold fitting for noise reduction and hierarchical clustering.

📊How does manifold fitting work in CellScope?

Stage 1 selects signal genes via seeds and cliques + ANOVA; Stage 2 projects outliers to denoised manifolds, enabling accurate cell relationships.

🏆What are CellScope's performance advantages?

ARI 0.88 vs. Seurat's 0.65 on 36 datasets; faster, scalable, better rare cell detection. See paper.

🧬What biological insights does CellScope reveal?

Resolves Oligo subtypes in brain atlases; COVID immune markers like IFIT1. Enables disease subtyping for precision medicine.

💻How to install and use CellScope?

Python package via GitHub: repo. Tutorials for clustering, trees. Docs: cellscope.readthedocs.io.

👥Who developed CellScope?

Led by Assoc. Prof. Zhigang Yao (NUS Statistics), with Bingjie Li et al. Funded by Singapore MOE grants.

🔗Relation to scAMF?

scAMF (PNAS 2024) introduced manifold fitting; CellScope adds tree structures for full atlases.

💉Applications in medicine?

Cancer subtyping, neurodegeneration, immunology. Potential for personalized therapies via patient atlases.

🇸🇬Impact on Singapore biotech?

Boosts NUS's global rank (QS 8th); aligns with RIE2030 for biomed hub. Singapore uni jobs.

🚀Future developments for CellScope?

Multimodal integration (spatial+RNA), larger atlases. Yao group eyes extensions for Human Cell Atlas.

⚖️Compare CellScope to Seurat?

CellScope: higher ARI, multi-level trees, less params. Seurat strong but noise-sensitive on rares.