Bobby Ranjan

About Me

Hi! I'm Bobby Ranjan, a computational biologist at EMBL Heidelberg. I have 7+ years of experience developing and applying single-cell multi-omics methods, combined with hands-on expertise in mouse and molecular biology. I'm passionate about leveraging gene–environment interactions to uncover mechanisms driving disease risk.

Education

Doctor of Philosophy | Computational and Systems Biology

2021 – 2025

European Molecular Biology Laboratory & Heidelberg University, Germany

Bachelor of Engineering in Computer Engineering

2014 - 2018

Minor in Entrepreneurship
Minor in Life Sciences

Nanyang Technological University, Singapore

Selected Publications

Abstract

The paternal preconception environment has been implicated as a modulator of phenotypic traits and disease risk in F1 offspring. However, the prevalence and mechanisms of such intergenerational epigenetic inheritance (IEI) in mammals remain poorly defined. Moreover, the interplay between paternal exposure, genetics, and age on emergent offspring features is unexplored. Here, we measure the quantitative impact of three paternal environments on early embryogenesis across genetic backgrounds. Using in vitro fertilisation (IVF) at scale, we capture batch-robust transcriptomic signatures of IEI with single-blastocyst resolution. Amongst these, paternal gut microbiota dysbiosis is linked with aberrant expression of (extra-)embryonic lineage regulators in blastocysts. In contrast, a paternal low-protein high-sugar diet associates with subtle preimplantation growth effects. We further identify gene expression variability as a paternally induced F1 phenotype, and highlight confounding issues for IEI, such as batch effects and under-sampling. Finally, while genetic background dominantly modifies the inherited signature of paternal environment, aged fathers universally impact F1 expression programmes across genetic contexts. This study systematically characterises how paternal conditioning programs subtle but detectable molecular responses in early embryos, and proposes guiding principles to dissect intergenerational phenomenology.

Abstract

The gut microbiota operates at the interface of host–environment interactions to influence human homoeostasis and metabolic networks. Environmental factors that unbalance gut microbial ecosystems can therefore shape physiological and disease-associated responses across somatic tissues. However, the systemic impact of the gut microbiome on the germline—and consequently on the F1 offspring it gives rise to—is unexplored10. Here we show that the gut microbiota act as a key interface between paternal preconception environment and intergenerational health in mice. Perturbations to the gut microbiota of prospective fathers increase the probability of their offspring presenting with low birth weight, severe growth restriction and premature mortality. Transmission of disease risk occurs via the germline and is provoked by pervasive gut microbiome perturbations, including non-absorbable antibiotics or osmotic laxatives, but is rescued by restoring the paternal microbiota before conception. This effect is linked with a dynamic response to induced dysbiosis in the male reproductive system, including impaired leptin signalling, altered testicular metabolite profiles and remapped small RNA payloads in sperm. As a result, dysbiotic fathers trigger an elevated risk of in utero placental insufficiency, revealing a placental origin of mammalian intergenerational effects. Our study defines a regulatory ‘gut–germline axis’ in males, which is sensitive to environmental exposures and programmes offspring fitness through impacting placenta function.

Abstract

This cross-sectional investigation examined peripheral blood mononuclear cells from 46 manifest type 1 diabetes patients and 31 controls using single-cell transcriptomic analysis. The study identified profound alterations in circulatory immune cells (1784 dysregulated genes in 13 immune cell types), with upregulated genes involved in WNT signaling, interferon response, T/NK cell migration, and monocyte activation. The authors developed a T1DM metagene z-score that distinguished cases from controls and stratified patients into molecular subtypes. This score correlated with established prognostic immune markers and clinical trial drug response patterns. The findings reveal a surprisingly strong systemic dimension at the level of the immune cell network in T1DM, define disease-relevant molecular subtypes, and may facilitate non-invasive testing and patient classification strategies.

Abstract

Feature selection (marker gene selection) is widely believed to improve clustering accuracy, and is thus a key component of single cell clustering pipelines. Existing feature selection methods perform inconsistently across datasets, occasionally even resulting in poorer clustering accuracy than without feature selection. Moreover, existing methods ignore information contained in gene-gene correlations. Here, we introduce DUBStepR (Determining the Underlying Basis using Stepwise Regression), a feature selection algorithm that leverages gene-gene correlations with a novel measure of inhomogeneity in feature space, termed the Density Index (DI). Despite selecting a relatively small number of genes, DUBStepR substantially outperformed existing single-cell feature selection methods across diverse clustering benchmarks. Additionally, DUBStepR was the only method to robustly deconvolve T and NK heterogeneity by identifying disease-associated common and rare cell types and subtypes in PBMCs from rheumatoid arthritis patients. DUBStepR is scalable to over a million cells, and can be straightforwardly applied to other data types such as single-cell ATAC-seq. We propose DUBStepR as a general-purpose feature selection solution for accurately clustering single-cell data.

Abstract

The transcriptomic diversity of cell types in the human body can be analysed in unprecedented detail using single cell (SC) technologies. Unsupervised clustering of SC transcriptomes, which is the default technique for defining cell types, is prone to group cells by technical, rather than biological, variation. Compared to de-novo (unsupervised) clustering, we demonstrate using multiple benchmarks that supervised clustering, which uses reference transcriptomes as a guide, is robust to batch effects and data quality artifacts. Here, we present RCA2, the first algorithm to combine reference projection (batch effect robustness) with graph-based clustering (scalability). In addition, RCA2 provides a user-friendly framework incorporating multiple commonly used downstream analysis modules. RCA2 also provides new reference panels for human and mouse and supports generation of custom panels. Furthermore, RCA2 facilitates cell type-specific QC, which is essential for accurate clustering of data from heterogeneous tissues. We demonstrate the advantages of RCA2 on SC data from human bone marrow, healthy PBMCs and PBMCs from COVID-19 patients. Scalable supervised clustering methods such as RCA2 will facilitate unified analysis of cohort-scale SC datasets.

Availability: RCA2 is implemented in R and is available on GitHub.

Abstract

Background: Clustering is a crucial step in the analysis of single-cell data. Clusters identified in an unsupervised manner are typically annotated to cell types based on differentially expressed genes. In contrast, supervised methods use a reference panel of labelled transcriptomes to guide both clustering and cell type identification. Supervised and unsupervised clustering approaches have their distinct advantages and limitations. Therefore, they can lead to different but often complementary clustering results. Hence, a consensus approach leveraging the merits of both clustering paradigms could result in a more accurate clustering and a more precise cell type annotation.
Results: We present scConsensus, an R framework for generating a consensus clustering by (i) integrating the results from both unsupervised and supervised approaches and (ii) refining the consensus clusters using differentially expressed (DE) genes. The value of our approach is demonstrated on several existing single-cell RNA sequencing datasets, including data from sorted PBMC sub-populations.
Conclusions: scConsensus combines the merits of unsupervised and supervised approaches to partition cells with better cluster separation and homogeneity, thereby increasing our confidence in detecting distinct cell types. scConsensus is freely available on GitHub.

Abstract

This study examined gene expression patterns in over 91,000 individual cells from 29 colorectal cancer patients across Korean and Belgian cohorts. The research found that cancer cells display gene expression patterns resembling normal cellular differentiation while harboring genetic changes that establish immunosuppressive surroundings. The suppressive microenvironment involved regulatory T cells, myofibroblasts, and myeloid cells. Network analysis revealed associations between specific cancer cell signatures and particular stromal and immune populations. The comprehensive cellular landscape mapping and intercellular interaction data offer insights for developing more effective immunotherapy strategies for colorectal cancer treatment.

Abstract

Background: Alzheimer's disease (AD) is a progressive neurological disorder, recognized as the most common cause of dementia affecting people aged 65 and above. AD is characterized by an increase in amyloid metabolism, and by the misfolding and deposition of β-amyloid oligomers in and around neurons in the brain. These processes remodel the calcium signaling mechanism in neurons, leading to cell death via apoptosis. Despite accumulating knowledge about the biological processes underlying AD, mathematical models to date are restricted to depicting only a small portion of the pathology.
Results: Here, we integrated multiple mathematical models to analyze and understand the relationship among amyloid depositions, calcium signaling and mitochondrial permeability transition pore(PTP)-related cell apoptosis in AD. The model was used to simulate calcium dynamics in the absence and presence of AD. In the absence of AD, i.e. without β-amyloid deposition, mitochondrial and cytosolic calcium level remains in the low resting concentration. However, our in silico simulation of the presence of AD with the β-amyloid deposition, shows an increase in the entry of calcium ions into the cell and dysregulation of Ca²⁺ channel receptors on the Endoplasmic Reticulum. This composite model enabled us to make simulation that is not possible to measure experimentally.
Conclusions: Our mathematical model depicting the mechanisms affecting calcium signaling in neurons can help understand AD at the systems level and has potential for diagnostic and therapeutic applications.

Experience

Computational Biologist, Stegle Group

2026 – Present

European Molecular Biology Laboratory (EMBL), Heidelberg, Germany

Research topic: Genome-exposome interface for multi-disease risk prediction

Predoctoral Fellow, Hackett Group

2021 – 2025

European Molecular Biology Laboratory (EMBL), Rome, Italy

Research topic: Intergenerational epigenetic inheritance — mechanisms by which paternal environment shapes offspring phenotype via the germline

Bioinformatics Specialist, Prabhakar Lab

August 2018 – July 2021

Genome Institute of Singapore

Developing algorithms for cell type identification in single-cell data

Software Design Engineer Intern

May - August 2017

BitTitan

Built customer-facing license consumption report for all BitTitan products
Conducted tech feasibility analysis to improve BitTitan’s reporting capacity
Built code analysis tool to clean up database references across codebase

Technology Analyst Intern

August - December 2016

Bank of America, Merrill Lynch (Singapore)

Worked on the payments processing and payments testing development teams
Redesigned database logging using a queueing mechanism with the help of Apache ActiveMQ and Java Spring Framework
Also built an application to help onboard new testers onto the testing platform, using Java, AngularJS and SQL

Bobby Ranjan, PhD

|

Languages

About Me

Education

Doctor of Philosophy | Computational and Systems Biology

Bachelor of Engineering in Computer Engineering

Selected Publications

Embryonic signatures of intergenerational epigenetic inheritance across paternal environments and genetic backgrounds

Abstract

Paternal microbiome perturbations impact offspring fitness

Abstract

Systematic immune cell dysregulation and molecular subtypes revealed by single-cell RNA-seq of subjects with type 1 diabetes

Abstract

DUBStepR is a scalable correlation-based feature selection method for accurately clustering single-cell data

Abstract

RCA2: a scalable supervised clustering algorithm that reduces batch effects in scRNA-seq data

Abstract

scConsensus: combining supervised and unsupervised clustering for cell type annotation in single-cell RNA-seq data

Abstract

Lineage-dependent gene expression programs influence the immune landscape of colorectal cancer

Abstract

Composite Mathematical Modeling of Calcium Signaling behind Neuronal Cell Death in Alzheimer's Disease

Abstract

Experience

Computational Biologist, Stegle Group

Predoctoral Fellow, Hackett Group

Bioinformatics Specialist, Prabhakar Lab

Software Design Engineer Intern

Technology Analyst Intern