Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain
the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in
Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles
and JavaScript.
Methodology development promotes science advancement. Application of new scientific discoveries to biological processes, organisms, or systems can help to sustainably produce products for healing, feeding, or fueling. This page showcases our recent publications that report methodological or biotechnological advances relevant to biological, biomedical, and agricultural sciences.
Cell-free gene expression (CFE) systems are often constrained by numerous biochemical components required to maintain biocatalytic efficiency. Here, the authors propose a droplet-AI combined approach to perform high-throughput and efficient combinatorial screening of CFE. This work led to simplified CFE systems with improved yield and cost-effectiveness.
Studying cell polarity is often limited by the complexity and variety of image data. Polarity-JaM is a software suite streamlining cell polarity analysis with easy-to-use tools and interfaces demonstrated for endothelial cell behaviour.
Regulatory sequences encode crucial gene expression signals, but their functionality across species remains unclear. Here, the authors introduce DeepCROSS, an AI-aided strategy that utilises large-scale meta-representation and an accurate predictor for the inverse design of regulatory sequences.
Three-dimensional imaging is crucial for biomedical research, yet microscopy faces axial resolution limitations. Here, authors introduce SSAI-3D that adapts training datasets and sparsely finetunes a network to achieve robust results across various biological samples and microscopy systems.
Human brain organoids are plagued by heterogeneity and poor reproducibility, critical parameters for reliable disease modeling and drug testing. Here, the authors report on Hi-Q organoids which solve these limitations and can be cryopreserved in large quantities.
Imaging mass spectrometry has revolutionised spatial metabolomics, but metabolites are annotated for only a fraction of the data. Here, authors show that METASPACE-ML enhances precision, increases throughput, and improves the identification of low-intensity and biologically relevant metabolites.
Obtaining a high-resolution contact map using current 3D genomics technologies can be challenging with small input cell numbers. Here, the authors develop ChromaFold, a deep learning model that predicts cell-type-specific 3D contact maps from single-cell chromatin accessibility data alone.
DNA is a promising medium for data storage but requires efficient sequencing methods for real-time data retrieval. Here, authors introduce Composite Hedges Nanopores (CHN), a nanopore-based codec scheme that improves sequencing efficiency and data accuracy, supporting real-time data retrieval.
Electricity can be used to stimulate the nervous system to treat diseases, and computer models are powerful tools for designing these therapies. Here, authors develop a model of neurons’ responses to electricity that is accurate and thousands of times faster than the current industry standard.
Human iPSC-CMs are invaluable for cardiac disease modeling and regeneration. Here, authors developed an optimized suspension culture protocol to efficiently and reproducibly differentiate hiPSCs into cardiomyocytes and cardiac organoids.
SpaGFT is a spatial omics representation method. It identifies spatially variable genes, enhances gene imputation, detects immunological regions, and characterises variations in secondary lymphoid organs. This method improves predictive power in wide downstream machine-learning tasks.
Analysis of nanopore long-read sequencing is challenged by technical noise, particularly in single cells. Here, authors introduce Isosceles, a toolkit for accurate isoform detection, quantification, and flexible downstream analysis of long-read data at single-cell, pseudo-bulk and bulk resolutions.
Batch effects can limit the usefulness of image-based profiling data. Here, authors benchmark ten popular batch correction techniques on a large Cell Painting dataset, evaluating multiple metrics. They identify Harmony and Seurat RPCA as top methods across diverse complex scenarios.
Comparative transcriptomics of whole brains across species is vital in neuroscience. Here, authors develop a deep learning method, BrainAlign, to align spatial transcriptomics across human and mouse brains. BrainAlign identifies conserved brain regions and uncovers similar patterns for marker genes.
Single-cell sequencing is vital for studying complex diseases but is costly. Here, authors introduce scSemiProfiler, a deep generative learning framework that infers single-cell profiles by combining bulk sequencing with single-cell data from selected samples, offering a cost-effective solution.
Multiplets in single-cell sequencing can obscure true biological findings. Here, authors present COMPOSITE, a model-based multiplet detection framework, which helps prevent multiplet clusters, especially in single-cell multiomics data.
High-performance promoters are needed for gene drives; these are currently lacking in Drosophila melanogaster. Here the authors tested eleven Drosophila melanogaster germline promoters in several configurations and show higher drive conversion efficiency with minimal embryo resistance.
Bioprinting has revitalized tissue regeneration efforts, yet challenges persist due to cell damage during fabrication and mechanical instability of printed scaffolds. Here, the authors develop a mechanical-assisted post-bioprinting strategy for loading cells into hollow scaffolds that effectively repair challenging bone defects.
Single-cell manipulation and processing techniques and improvements in mass spectrometry sensitivity make single-cell proteomic profiling feasible. This study presents a label-free approach for the characterisation of native N-glycans of single mammalian cells and ng-level blood isolates, demonstrating the potential to detect cell surface glycome changes at the single-cell level in health or disease.
Polypharmacology drugs are compounds designed to inhibit multiple protein targets. Here, authors use recent advances in AI to rapidly generate polypharmacology compounds against any pair of protein targets, experimentally validating numerous compounds targeting MEK1 and mTOR.
Metagenomic taxonomic profiling usually relies either on reads or assembled contigs/MAGs. Here, authors present RAT, a tool that integrates taxonomic signals from reads, contigs, and MAGs into one profile with high precision and sensitivity. RAT provides a comprehensive view of the microbiome.
Ultrasound localisation microscopy enables deep tissue microvascular imaging. Here, authors introduce LOCA-ULM, a deep learning pipeline enhancing localisation accuracy in high microbubble concentrations. LOCA-ULM reveals dense cerebrovascular networks and enhances the sensitivity of functional ULM.
While tricarboxylic acid cycle (TCA cycle) is required for heterotrophic microbes, it reduces carbon yield of industrial products due to the release of excess CO2. Here, the authors construct an E. coli strain without a functional TCA cycle and demonstrate its feasibility as a chassis strain for production of four separate compounds.
The unification of decision-making, communication, and memory would enable the programming of intelligent biotic systems. Here, the authors achieve this goal by engineering E. coli chassis cells with an array of inducible recombinases that mediate diverse genetic programs.
Extracellular microinterfaces provide cells with migration tracks in vivo. Here, the authors introduce these microtracks into bicontinuous hydrogels to elicit rapid cell migration in 3-dimensional contexts.
RNA splicing serves as a critical layer of gene expression regulation. Here, authors introduce SCASL for investigating the heterogeneity of RNA splicing landscapes at single-cell resolution, offering a novel scheme for classifying cell identities with physiological relevance.
2D visualisation of single-cell data is highly impacted by the hyperparameter setting of the 2D embedding method, such as t-SNE and UMAP. Here, authors develop a statistical method scDEED to detect dubious cell embeddings and optimise the hyperparameter setting for trustworthy visualisation.
Selecting omic biomarkers using both their effect size and their differential status significance (i.e., selecting the “volcano-plot outer spray”) has long been equally biologically relevant and statistically troublesome. However, recent proposals are paving the way to resolving this dilemma.
Fungi have the potential to produce sustainable foods for a growing population, but current products are based on a small number of strains with inherent limitations. Here, the authors develop genetic tools for an edible fungus and engineer its nutritional value and sensory appeal for alternative meat applications.
Machine learning applied to large compendia of transcriptomic data has enabled the decomposition of bacterial transcriptomes to identify independently modulated sets of genes. Here the authors present iModulon-based engineering for precise identification of genes for cross-species function transfer to streamline synthetic biology for strain development and biomanufacturing.
Proteomics at the organelle contact site remains challenging due to the spatial and temporal dynamics of proteins. Here, the authors developed OrthoID, a mutually orthogonal dual enzymatic proteomics approach to explore the proteome at the contact site of the endoplasmic reticulum and mitochondria.
Vaccines combat global influenza threats, relying on timely selection of optimal seed viruses. Here, authors introduce MAIVeSS, a machine learning assisted framework to streamline vaccine seed virus selection using genomic sequence, expediting seasonal flu vaccine production and supply.
Batch effects hinder multi-sample single-cell data analyses. Here, authors present STACAS, a scalable single-cell RNA-seq data integration tool that uses prior cell type knowledge to preserve biological variability, demonstrating robustness to noisy input cell type labels.
Typical single-cell RNAseq pipelines will subcluster homogeneous cells. Here, authors present a computational algorithm for accurately identifying cell-type marker genes in single-cell data analysis with a low false discovery rate.
No consensus exists on the computationally tractable use of dynamic models for strain design. To tackle this, the authors report a framework, nonlinear-dynamic-model-assisted rational metabolic engineering design, for efficiently designing robust, artificially engineered cellular organisms.
Synthetic microbial communities are suitable for mixed substrates fermentation and long metabolic pathway engineering. Here, the authors combine fermentation experiments with mathematical modeling to reveal the effect of compositional and temporal changes on division of labor in cellulosic ethanol production using two yeast strains.
Achieving genetic circuits on single DNA molecules could have varied applications. Here, authors observed proteins emerging from single DNA molecules through coupled transcription-translation complexes, and show that nascent proteins lingered on DNA, regulating cascaded reactions on the same DNA and allowing the design of a pulsatile genetic circuit.
Utilising geometric information and reducing computational costs are key challenges in the molecular modelling field. Here, authors propose ViSNet, which efficiently extracts geometric features, accurately predicts molecular properties, and drives simulations with interpretability.
Subcellular in situ spatial transcriptomics offers the promise to address biological problems that were previously inaccessible but requires accurate cell segmentation to uncover insights. Here, authors present BIDCell, a biologically informed, deep learning-based cell segmentation framework.
Identifying rare cell populations is key to understanding cancer progression and response to therapy. Here, authors introduce MarsGT, an end-to-end deep learning model for rare cell population identification from scMulti-omics data.
Derivation of human primordial germ cell-like cells (hPGCLCs) is critical for reproductive medicine. Here, authors report the induction of hPGCLCs in a bioengineered human pluripotent stem cell culture that mimics peri-implantation human development.
Somatic cloning of rhesus monkey has not been successful until now. Here, authors report epigenetic abnormalities in SCNT embryos and placentas and develop a trophoblast replacement method that enables them to successful clone of a healthy male rhesus monkey.
Reproducibility is essential for the progress of research, yet achieving it remains elusive even in computational fields. Here, authors develop the rworkflows suite, making robust CI/CD workflows easy and freely accessible to all R package developers.
Copy number variants (CNV) are shown to contribute to the etiology of various genetic disorders. Here, authors present ECOLE, a deep learning-based somatic and germline CNV caller for WES data. Utilising a variant of the transformer architecture, the model is trained to call CNVs per exon.
Batch integration is a critical yet challenging step in many single-cell RNA-seq analysis workflows. Here, authors present JOINTLY, a hybrid linear and non-linear NMF-based algorithm, providing interpretable and robust cell clustering against over-integration.
Deciphering the roles of gene regulation in cell fate decisions is crucial. Here, authors present CEFCON, a network-based framework that reveals cell-lineage-specific gene regulatory networks and identifies driver regulators controlling cell fate decisions from single-cell transcriptomics data.
Spatial transcriptomics (ST) enables gene expression characterisation within tissue sections, but comparing across sections and technologies remains challenging. Here, authors develop STalign to spatially align ST data and demonstrate applications including aligning to common coordinate frameworks.
Cultured meat technology promises to alleviate protein shortages, but still faces many challenges. Here, the authors achieve serum-free myogenic differentiation of porcine pre-gastrulation epiblast stem cells and generate meat-like tissue via edible plant-based scaffolds without any animal compounds.