Skip to content

Widen ObserveResult argument type and improve fallbackLocatorMethod argument handling #782

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 5 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions .changeset/petite-poets-kneel.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
---
"@browserbasehq/stagehand": minor
---

support non-string arguments in observations
4 changes: 4 additions & 0 deletions evals/evals.config.json
Original file line number Diff line number Diff line change
Expand Up @@ -325,6 +325,10 @@
"name": "radio_btn",
"categories": ["act"]
},
{
"name": "upload_file",
"categories": ["combination"]
},
{
"name": "checkboxes",
"categories": ["act"]
Expand Down
95 changes: 95 additions & 0 deletions evals/tasks/upload_file.ts
Original file line number Diff line number Diff line change
@@ -0,0 +1,95 @@
import { EvalFunction } from "@/types/evals";
import { z } from "zod";

export const upload_file: EvalFunction = async ({
logger,
debugUrl,
sessionUrl,
stagehand,
useTextExtract,
}) => {
try {
await stagehand.page.goto(
"https://browser-tests-alpha.vercel.app/api/upload-test",
);

const observations = await stagehand.page.observe(
"Upload a file to the file input with id 'fileUpload'",
);
if (observations.length !== 1) {
await stagehand.close();
return {
_success: false,
observations,
debugUrl,
sessionUrl,
logs: logger.getLogs(),
};
}

const uploadObservation = observations[0]!;
uploadObservation.arguments = [
{
name: "emoji.png",
mimeType: "image/png",
buffer: Buffer.from(
"",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should we read this from a file? would also help to look at what the image is

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

idt we need to read from a file. If you run the eval you can see what the image looks like

),
},
];

await stagehand.page.act(uploadObservation);

const fileSize = await stagehand.page.extract({
instruction:
"Get the size of the file. It should be the number inside the span with id fileSize.",
schema: z.object({
fileSize: z.number().describe("The size of the file in bytes"),
}),
useTextExtract,
});

await stagehand.close();

if (!fileSize || !fileSize.fileSize) {
return {
_success: false,
logs: logger.getLogs(),
debugUrl,
sessionUrl,
};
}

return {
_success: true,
fileSize,
logs: logger.getLogs(),
debugUrl,
sessionUrl,
};
} catch (error) {
logger.error({
message: `error in upload_file function`,
level: 0,
auxiliary: {
error: {
value: error.message,
type: "string",
},
trace: {
value: error.stack,
type: "string",
},
},
});

await stagehand.close();

return {
_success: false,
logs: logger.getLogs(),
debugUrl,
sessionUrl,
};
}
};
4 changes: 3 additions & 1 deletion lib/handlers/actHandler.ts
Original file line number Diff line number Diff line change
Expand Up @@ -244,7 +244,9 @@ export class StagehandActHandler {
if (actionOrOptions.variables) {
Object.keys(actionOrOptions.variables).forEach((key) => {
element.arguments = element.arguments.map((arg) =>
arg.replace(`%${key}%`, actionOrOptions.variables![key]),
typeof arg === "string"
? arg.replace(`%${key}%`, actionOrOptions.variables![key])
: arg,
);
});
}
Expand Down
14 changes: 9 additions & 5 deletions lib/handlers/handlerUtils/actHandlerUtils.ts
Original file line number Diff line number Diff line change
Expand Up @@ -383,11 +383,15 @@ export async function fallbackLocatorMethod(ctx: MethodHandlerContext) {
});

try {
await (
locator[method as keyof Locator] as unknown as (
...a: string[]
) => Promise<void>
)(...args.map((arg) => arg?.toString() || ""));
const locatorMethod = locator[method as keyof Locator];
if (typeof locatorMethod !== "function") {
throw new PlaywrightCommandException(
`Method ${method} is not supported by locator`,
);
}
await (locatorMethod as unknown as (...a: unknown[]) => Promise<void>)(
...(args as Parameters<typeof locatorMethod>),
);
} catch (e) {
logger({
category: "action",
Expand Down
2 changes: 1 addition & 1 deletion types/stagehand.ts
Original file line number Diff line number Diff line change
Expand Up @@ -152,7 +152,7 @@ export interface ObserveResult {
description: string;
backendNodeId?: number;
method?: string;
arguments?: string[];
arguments?: unknown[];
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

wondering if we should be a bit more explicit here

}

export interface LocalBrowserLaunchOptions {
Expand Down