Why the old checks fail (and what labs quietly dread)
A shared sequencing run on a Friday, 48 capture areas processed, and a 35% hit in failed barcodes—what should we attribute that to? I dive into gene expression dataset quality every week, and I see the same weak links: inconsistent tissue preparation, sloppy barcode demultiplexing, and overreliance on blanket QC thresholds that don’t fit spatial transcriptomics runs (been there in San Diego—March 2023, I logged the times). I write as someone who’s run a core facility for over 15 years; I know the small operational quirks that become big data problems.

Why is this so hard?
I remember a specific case: a Visium slide run at a university core that produced what looked like 80% reads but only 50% usable spots after alignment—no kidding, we lost half the data. That taught me two things fast. First, traditional bulk RNA QC metrics lie when applied to spatial workflows. Second, the pain point isn’t just technology—it’s the human hand off between tissue prep, imaging, and library prep. When we ignore those handoffs, a perfectly good gene expression dataset becomes noisy and weak. I use “single-cell RNA-seq” and “multiplexing” checks now as part of routine triage; they flag problems earlier.
What I do differently — concrete fixes from the trenches
Here’s what I recommend, from running procurement, mentoring technicians, and troubleshooting runs: replace blunt QC pass/fail with layered checks. I build a short checklist that includes tissue integrity scoring, image-to-spot registration verification, and per-spot read distribution (simple, measurable). For instance, adding an imaging verification step before library prep dropped our re-run rate by about 40% at one site. We also log exact lot numbers of reagents; once, swapping a reagent lot reduced background by half (that was in July 2022). These are small, specific actions—no abstract strategy talk—just practical moves that save time and money.
(Side note: we track barcode demultiplexing failures separately now—this exposed a recurring batch effect tied to a particular technician’s handling.) I want labs to stop chasing generic benchmarks and start tracking the things I can change tomorrow: operator variance, imaging alignment, spot-level coverage. Simple. Direct. Actionable.
What’s Next?
Forward-looking, I push labs toward comparative pipelines and reproducible checks. Instead of a single QC script, I run parallel pipelines—one optimized for high-resolution tissue mosaics and another for sparser samples—then compare outputs. Wait—this revealed that some samples assumed to be low-quality were simply ill-processed for the chosen pipeline. And then—by reprocessing with the better-suited pipeline, we reclaimed usable data from 15% of samples that had been marked failures. That’s the sort of measurable gain I chase.
Looking ahead, I expect spatial transcriptomics platforms to provide more transparent intermediate metrics—raw spot-level images, better multiplexing metadata, and clearer error flags so teams can act without guessing. We must demand standard metadata fields across platforms; otherwise a gene expression dataset from Lab A and Lab B will never be comparable. I keep the tone pragmatic here—I’ve coordinated cross-lab comparisons between a Boston biotech and a university core in 2024, and standardized metadata made the difference.

Closing thoughts and how to evaluate options
I’ll be blunt: you’ll waste time if you treat spatial omics like bulk RNA. Evaluate tools by three metrics—(1) spot-level recoverability, (2) transparency of intermediate metrics, and (3) ease of reproducing preprocessing steps. Those metrics reflect real pain points I’ve fixed repeatedly. If you want an honest partner for these checks, I work with platforms that value clear metadata and reproducible pipelines. For straightforward access to curated resources and documentation, check out stomics. Seriously—small process changes add up fast.
