
In this use case, we reproduce part of the analysis from the study “HOX13-dependent chromatin accessibility underlies the transition towards the digit development program” (Desanlis et al., 2020, Nature Communications) to demonstrate how Drylab can replicate published computational workflows.

Objective:
The transition from proximal to distal limb identity and the formation of digits require precise chromatin remodeling.
HOX13 proteins (HOXA13 and HOXD13) are known to be essential for digit development, but their exact role at the chromatin level remained incompletely understood.
Methods Summary
Data source: GSE145657 — two
.snapfiles (GSM4323464, KO; GSM4323465, WT), extracted from a 2.5 GB tarball (Desanlis et al., Nat Commun (2020)) to provide a comprehensive, reproducible view of HOX13-dependent chromatin accessibility changes in the developing mouse forelimb bud.QC: Three-threshold evaluation (lenient / standard / strict) on log10(total fragments), TSS-region signal (SE), and FRIP (PL/PP). Standard threshold selected.
Dimensionality reduction: Feature selection (top 50,000 variable bins), spectral embedding (50 components, SnapATAC2
tl.spectral), UMAP at k = 15, 30, 50.Clustering: Leiden algorithm grid search over k in {15, 30, 50} and resolution in {0.2, 0.4, 0.6, 0.8, 1.0, 1.5} (18 combinations). Stability assessed via silhouette score and 5-fold 80% subsampling ARI.
Statistical testing: Global chi-squared test; per-cluster two-sided Fisher's exact test with Benjamini-Hochberg FDR correction; log2 odds ratio with 1,000-iteration bootstrap 95% CI.
Key Findings
QC Filtering
Threshold | Cells retained |
|---|---|
Lenient (log10TN >= 2.5, SE >= 50, FRIP >= 0.60) | 10,072 (76.5%) |
Standard (log10TN >= 3.0, SE >= 100, FRIP >= 0.70) | 9,437 (71.6%) |
Strict (log10TN >= 3.5, SE >= 200, FRIP >= 0.75) | 9,019 (68.5%) |
The standard threshold was selected, retaining 5,581 WT and 3,856 KO cells with median FRIP approximately 0.92, indicating high library quality.
Dimensionality Reduction
Spectral embedding yielded 46 non-trivial components; elbow detection placed the dominant biological signal at component 5.
UMAP topology was stable across k = 15, 30, 50, confirming robustness of the manifold structure.
Clustering Stability Grid Search
Parameter set | n_clusters | Silhouette | Subsampling ARI |
|---|---|---|---|
k=15, res=1.0 | 19 | 0.136 | 0.724 +/- 0.059 |
k=30, res=1.0 | 17 | 0.144 | 0.776 +/- 0.042 |
k=50, res=0.6 | 12 | 0.172 | 0.802 +/- 0.074 |
The optimal parameters (k = 50, resolution = 0.6) achieved the highest silhouette score and subsampling ARI simultaneously, yielding 12 clusters. [,]
WT vs KO Distribution
Global chi-squared test: statistic = 2,631.6, df = 11, p approximately 0 — highly significant genotype-cluster association.
Per-cluster Fisher's exact test (BH-FDR corrected):
Cluster | n_WT | n_KO | % WT | log2 OR | 95% CI | FDR |
|---|---|---|---|---|---|---|
C4 | 0 | 859 | 0% | -11.64 | (-11.71, -11.58) | 0 |
C1 | 1,645 | 1 | 99.9% | +10.07 | (+8.80, +11.70) | 0 |
C0 | 789 | 963 | 45% | -1.02 | (-1.17, -0.87) | 3.9e-39 |
C5 | 383 | 375 | 50.5% | -0.55 | (-0.77, -0.33) | 1.5e-6 |
C7 | 358 | 98 | 78.5% | +1.39 | (+1.08, +1.74) | 1.5e-18 |
C2 | 772 | 464 | 62.5% | +0.23 | (+0.05, +0.41) | 0.022 |
C8 | 275 | 152 | 64.4% | +0.33 | (+0.05, +0.63) | 0.040 |
C3, C6, C9-C11 | -- | -- | ~58-67% | ~0 | crosses 0 | > 0.24 (NS) |

Interpretation
The scATAC-seq landscape resolves 12 stable chromatin accessibility states in the developing limb bud , consistent with the multiple mesenchymal clusters reported in the original publication. The moderate overall silhouette score (0.172) is expected for scATAC-seq data with continuous developmental transitions, yet the high subsampling ARI (0.802) confirms that these 12 clusters represent reproducible chromatin states rather than noise-driven partitions.
Cluster C4 is the most HOX13-dependent population. C4 contains 859 KO cells and 0 WT cells, representing a completely KO-exclusive chromatin state (log2 OR = -11.64, FDR = 0) [,]. This corresponds to the KO-specific cluster (cluster 3 in the original numbering) described by Desanlis et al., which displayed a proximal-like but distinct chromatin signature that emerges only in the absence of HOX13 Desanlis et al., Nat Commun 2020.
Cluster C1 represents the HOX13-dependent digit program. C1 contains 1,645 WT cells and only 1 KO cell (log2 OR = +10.07, FDR = 0), identifying it as the chromatin state that requires HOX13 for establishment.

This aligns with the original study's distal-specific clusters (clusters 2 and 6) that were enriched for the HOXA13 motif and associated with digit morphogenesis gene ontology terms, and that were absent in Hox13-/- samples.
The extreme bimodal genotype segregation — with KO cells completely excluded from C1 and shunted into C4 — demonstrates that HOX13 loss does not uniformly perturb all cell states but instead prevents cells from accessing the digit-specific chromatin program and diverts them into an aberrant KO-specific configuration . The original paper explicitly noted that "chromatin accessibility did not simply switch to a proximal limb bud profile," consistent with the observation here that C4 is a distinct state rather than a simple reversion.
These data support a model in which HOX13 acts as a master regulator — and likely a pioneer transcription factor — of the digit-specific chromatin accessibility program, with its absence preventing the opening of distal limb-specific loci. The moderately affected clusters (C0: 55% KO, FDR = 3.9e-39; C5: 49.5% KO, FDR = 1.5e-6; C7: 78.5% WT, FDR = 1.5e-18) reflect intermediate progenitor states where HOX13 activity modulates but does not completely determine chromatin accessibility. The KO-exclusive C4 population may represent an alternative, partially proximal-like chromatin configuration that is normally suppressed by HOX13 activity during the critical window of digit morphogenesis.
The cluster numbering in this reanalysis (C1, C4) does not directly correspond to the original study's labels (clusters 2/6 for WT-distal, cluster 3 for KO-specific), as expected given differences in preprocessing parameters, bin size, and clustering resolution. Nonetheless, the biological phenomena — a WT-exclusive digit chromatin state and a KO-exclusive aberrant state — are fully reproduced.
Recommended Next Steps
Differential accessibility analysis: Apply
findDARordiff_testbetween C1 (digit program) and C4 (KO-aberrant) to identify the specific genomic loci whose accessibility is HOX13-dependent.Motif enrichment: Perform motif enrichment analysis on differentially accessible regions to confirm HOX13 motif prevalence at C1-specific peaks and identify compensatory TF motifs enriched in C4.
Integration with scRNA-seq: Link chromatin accessibility changes to gene expression (e.g., Bmp2, Shh pathway genes) via gene activity scores or multi-omic integration.
Pseudotime trajectory: Model the developmental trajectory from shared progenitor states toward the digit program (C1) versus the KO-aberrant state (C4) to identify the chromatin bifurcation point.


