Bridging Benchling and AWS: How GenAI and MCP Accelerate Scientific Workflows

Kevin Moore January 9, 2026

Nextflow S3 AI Benchling AWS MCP

Watch now: Kevin Moore & Vasisht Tadigotla: Accelerating Cross-Assay Correlation with GenAI

Transform hours of tedious manual data analysis into minutes with connected scientific data and AI

Like many modern biotech companies, Sail Biomedicines performs multiple types of experiments. Their platform, which enables AI-based programming of endless RNA™ (eRNA™), leverages multiple workflows to deeply characterize RNA. But to do so, the team faces the challenge of combining and normalizing results and metadata across multiple data systems. For Sail, two of these systems include Benchling and AWS:

Benchling: The system of record, which tracks sample registration, assays, and experimental context, and is home to low-throughput experimental results (ELISAs, qPCR, Flow Cytometry, etc.)
AWS: The home for high-throughput data sets, including large next-generation DNA sequencing-based results.

Even answering simple questions requires data from both systems. For example, if a scientist wants to know, “Can I use RNA expression levels of eRNA™ constructs from a specific program as a proxy for ranking protein expression?”, they would have to:

Find the relevant datasets in Benchling and AWS
Pull expression data from Nextflow results in Amazon S3
Match the metadata with results in the Benchling warehouse
Collate the data
Perform the analysis and generate a report

This process can take a significant amount of time, adding friction to the design, build, and test cycle of improving eRNA™ performance. When scientists are stuck waiting to answer “Did my experiment work and what should I do next?” innovation slows.

How AI and Model Context Protocol (MCP) Enable Faster Scientific Data Analysis

At the 2025 Nextflow Summit, Vasisht Tadigotla from Sail Biomedicines presented a new, AI-first approach that replaces these time-consuming, manual steps with a single AI prompt. Connecting Anthropic's Claude Sonnet 4.5 to MCP servers from Benchling and Quilt.bio (the scientific data management system for AWS), bench scientists can get answers to their cross-assay questions in minutes instead of hours.

Giving LLMs the Tools They Need for Scientific Data Analysis

Today's LLMs can reason about scientific questions, but their context windows are much smaller than biotech datasets. For an LLM to help scientists analyze data across systems, a single AI session must search and query data from multiple sources, and therefore relies on numerous tools:

Searching and filtering massive datasets.
Querying structured tables (SQL over columnar data).
Joining data across disparate systems (e.g., Bench metadata ↔ Pipeline outputs).
Executing complex computations.
Producing reproducible artifacts (versioned datasets, plots).

That’s where MCP comes in. MCP is a standard that allows Large Language Models (LLMs) to interact with external tools in a structured, auditable way. Instead of each system having its own isolated AI, MCP allows one AI session to securely access multiple systems through standardized tool interfaces. Crucially, this is achieved without the model ingesting all the data. Data stays in its source system, and access control is enforced by that system, which makes MCP servers well-suited to highly regulated, large-scale scientific environments.

For companies – like Sail – that manage large-scale scientific data in AWS, AI requires an extra layer to transform raw S3 (records) into analysis-ready data. Quilt provides this layer. With Quilt, users can store, organize, and retrieve terabyte-scale data effortlessly within Amazon S3 and attach it to Benchling Notebooks via seamless integration. This integration ensures that all data is securely managed, easily accessible, and fully labeled for streamlined workflows, enhancing productivity in the lab.

MCP Tools Needed to Bridge Benchling and AWS

A Concrete Example: RNA–Protein Correlation at Sail Biomedicines

Sail scientists wanted to understand how RNA expression (on S3) for eRNA™ constructs in a specific program correlated with protein abundance in Benchling.

Using Anthropic's Claude Sonnet 4.5 connected to MCP Servers from Quilt and Benchling, Sail scientists ran a single prompt that:

Located relevant Quilt packages in S3
Queried RNA-seq expression tables at scale
Pulled matching protein assay results from Benchling
Aligned samples and gene identifiers
Generated comparative visualizations
Packaged the analysis as a new Quilt package

Total time to analysis: ~10 minutes

No file wrangling.
No hand-written SQL.
No hand-off to a data science team.

With GenAI and MCP bridging their Benchling and AWS systems, Sail’s Bench scientists can now:

Perform meaningful analyses themselves
Quickly see computational results in their experimental context
Iterate faster on hypotheses, accelerating experimentation

Data and platform teams can:

Avoid bespoke integrations
Preserve AWS-native governance
Focus on advanced analysis rather than routine requests

MCP enables AI systems to accelerate scientific workflows that cut across system boundaries. Data analysis that normally requires hours to days of tedious, manual work across bench scientists and computational teams takes minutes with this breakthrough approach. To learn more, watch the video:

Watch now: Kevin Moore & Vasisht Tadigotla: Accelerating Cross-Assay Correlation with GenAI

Bridging Benchling and AWS: How GenAI and MCP Accelerate Scientific Workflows

Transform hours of tedious manual data analysis into minutes with connected scientific data and AI

How AI and Model Context Protocol (MCP) Enable Faster Scientific Data Analysis

Giving LLMs the Tools They Need for Scientific Data Analysis

MCP Tools Needed to Bridge Benchling and AWS

A Concrete Example: RNA–Protein Correlation at Sail Biomedicines

Comments

Related Posts

Quilt + Benchling Integration: Seamless Data Packaging for Life Science Labs

A.I. is Just Compute: Governance for LLMs is easier than you think (with AWS Bedrock examples)

Nextflow and the Metadata Gap: How Quilt Bridges Science and Scale