feat: Add RunPipeline tool #253

vm-mishchenko · 2025-05-14T18:45:51Z

Add a new RunPipeline tool that can execute aggregation pipeline without requiring an Atlas account, cluster, or collection.

The tool accepts a set of documents, an aggregation pipeline, and a search index definition, and runs them against the Search Playground. The Search Playground internally creates an ephemeral collection and executes the pipeline in a temporary environment.

Manual testing + integration test. More tests will be added in the following prs.

What would be the result of query?

Is the query syntax correct?

Why $search doesn't return first doc?

edgarw19 · 2025-05-14T19:35:35Z

src/tools/playground/runPipeline.ts

+export const RunPipelineOperationArgs = {
+    documents: z
+        .array(z.record(z.string(), z.unknown()))
+        .describe("Documents to run the pipeline against. 500 is maximum.")


nit: Worth adding .max(500) to codify this?

nice idea, added

edgarw19 · 2025-05-14T19:37:41Z

src/tools/playground/runPipeline.ts

+export class RunPipeline extends ToolBase {
+    protected name = "run-pipeline";
+    protected description =
+        "Run aggregation pipeline for provided documents without needing an Atlas account, cluster, or collection.";


I wonder if it's helpful to provide some description context to the LLM about when to use this tool? Like can be useful in cases such as x, y, z., since the use cases seem more open ended than the other tools

Added a small clause: The tool can be useful for running ad-hoc pipelines for testing or debugging.

I agree, it's quite open ended tool so I would leave it to llm to decide when exactly it wants to use it.

edgarw19 · 2025-05-14T19:48:36Z

src/tools/playground/runPipeline.ts

+        .array(z.record(z.string(), z.unknown()))
+        .describe("Aggregation pipeline to run on the provided documents.")
+        .default(DEFAULT_PIPELINE),
+    searchIndexDefinition: z


Are more specific types for aggregationPipeline/searchIndexDefinition/synonyms useful for the LLM or is it already pretty good at determining the types from the description?

For ex, the search playground looks limited to a subset of aggregation pipeline stages. Would those be helpful to include in the type?

I feel it would be hard to add more specific zod types here. All these entities have a complex dynamic structure unfortunately.

I updated Aggregation pipeline... to MongoDB aggregation pipeline (same for other fields) to stress MongoDB part that hopefully nudges LLM to the right direction.

Regarding supported stages, I’d avoid listing them here. If we hardcode them, the list will likely get out of sync over time between the Playground and MCP. I’d rather rely on the Playground’s response to flag any unsupported stages. It actually supports more than what’s in the public docs (product wants to position it as a Search only playground for now).

edgarw19

lgtm!

Add RunPipeline tool

256587a

vm-mishchenko force-pushed the add-run-playground-tool branch from 6bee4bc to 256587a Compare May 14, 2025 18:55

vm-mishchenko changed the base branch from main to search-skunkworks-2025 May 14, 2025 18:55

vm-mishchenko marked this pull request as ready for review May 14, 2025 19:11

vm-mishchenko requested a review from a team as a code owner May 14, 2025 19:11

edgarw19 reviewed May 14, 2025

View reviewed changes

Address comments

3df5eca

vm-mishchenko force-pushed the add-run-playground-tool branch from 2f25ada to 3df5eca Compare May 15, 2025 15:35

vm-mishchenko requested a review from edgarw19 May 15, 2025 15:53

edgarw19 approved these changes May 15, 2025

View reviewed changes

mihirmpatil approved these changes May 15, 2025

View reviewed changes

vm-mishchenko merged commit 298adf4 into search-skunkworks-2025 May 15, 2025
9 checks passed

vm-mishchenko deleted the add-run-playground-tool branch May 15, 2025 16:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add RunPipeline tool #253

feat: Add RunPipeline tool #253

vm-mishchenko commented May 14, 2025 •

edited

Loading

edgarw19 May 14, 2025

vm-mishchenko May 15, 2025

edgarw19 May 14, 2025

vm-mishchenko May 15, 2025

edgarw19 May 14, 2025

vm-mishchenko May 15, 2025

edgarw19 left a comment

feat: Add RunPipeline tool #253

feat: Add RunPipeline tool #253

Conversation

vm-mishchenko commented May 14, 2025 • edited Loading

edgarw19 May 14, 2025

Choose a reason for hiding this comment

vm-mishchenko May 15, 2025

Choose a reason for hiding this comment

edgarw19 May 14, 2025

Choose a reason for hiding this comment

vm-mishchenko May 15, 2025

Choose a reason for hiding this comment

edgarw19 May 14, 2025

Choose a reason for hiding this comment

vm-mishchenko May 15, 2025

Choose a reason for hiding this comment

edgarw19 left a comment

Choose a reason for hiding this comment

vm-mishchenko commented May 14, 2025 •

edited

Loading