Implement models API #344

austin-denoble · 2025-05-09T19:31:27Z

Problem

Inference has new APIs which were made available in 2025-04. These allow for listing and describing available models hosted by Pinecone.

Solution

I took the liberty to refactor the Inference class to better align with other classes used in a similar way such as Index and Pinecone. I broke out all inference actions into individual files / functions: embed.ts and rerank.ts. I think a lot of this was implemented rather quickly initially, so I've done a bunch of cleanup while I was in here adding new methods.
Add new getModel.ts and listModels.ts files/functions. These are called and documented inside Inference.
Rework unit tests for embed and rerank. Basically just standardized to how we do mocking / testing elsewhere. They were a bit awkward.
Add unit tests for getModel, listModels, along with an integration test file.
Update README to add examples working with models.

Type of Change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update
Infrastructure change (CI configs, etc)
Non-code change (docs, etc)
None of the above: (explain here)

Test Plan

CI - external test, unit tests, integration tests

To test this you can pull this branch down, and run the repl locally:

from pinecone-ts-client root:

export PINECONE_API_KEY=<here>
npm run repl
await init()

await client.inference.listModels()
await client.inference.getModel('multilingual-e5-large')

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1209828518477622

…ods are built, and to better organize code now that we're adding models

…s endpoints, undo the breaking change in the embed function

## Problem Inference has new APIs which were made available in `2025-04`. These allow for listing and describing available models hosted by Pinecone. ## Solution - I took the liberty to refactor the `Inference` class to better align with other classes used in a similar way such as `Index` and `Pinecone`. I broke out all inference actions into individual files / functions: `embed.ts` and `rerank.ts`. I think a lot of this was implemented rather quickly initially, so I've done a bunch of cleanup while I was in here adding new methods. - Add new `getModel.ts` and `listModels.ts` files/functions. These are called and documented inside `Inference`. - Rework unit tests for `embed` and `rerank`. Basically just standardized to how we do mocking / testing elsewhere. They were a bit awkward. - Add unit tests for `getModel`, `listModels`, along with an integration test file. - Update README to add examples working with models. ## Type of Change - [ ] Bug fix (non-breaking change which fixes an issue) - [X] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to not work as expected) - [ ] This change requires a documentation update - [ ] Infrastructure change (CI configs, etc) - [ ] Non-code change (docs, etc) - [ ] None of the above: (explain here) ## Test Plan CI - external test, unit tests, integration tests To test this you can pull this branch down, and run the repl locally: from `pinecone-ts-client` root: ``` export PINECONE_API_KEY=<here> npm run repl await init() await client.inference.listModels() await client.inference.getModel('multilingual-e5-large') ``` --- - To see the specific tasks where the Asana app for GitHub is being used, see below: - https://app.asana.com/0/0/1209828518477622

austin-denoble added 7 commits May 9, 2025 15:17

refactor inference folder and Inference class to restructure how meth…

a2d49c6

…ods are built, and to better organize code now that we're adding models

hook up listModels and getModel operations

694b927

add unit tests for get and list models

47c177b

add integration tests for models

7160b96

retry on assistant file delete

d13f4ab

fix listModels test

b749711

regenerate core off of fixed api spec, add doc comments for new model…

84d1681

…s endpoints, undo the breaking change in the embed function

austin-denoble marked this pull request as ready for review May 9, 2025 22:20

update README, tweak doc comment

a16cb9e

austin-denoble merged commit b432d20 into 2025-04 May 9, 2025
26 checks passed

austin-denoble deleted the adenoble/implement-models-api branch May 9, 2025 22:57

austin-denoble mentioned this pull request May 9, 2025

Merge 2025-04 RC Branch #345

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement models API #344

Implement models API #344

Uh oh!

austin-denoble commented May 9, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Implement models API #344

Implement models API #344

Uh oh!

Conversation

austin-denoble commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Type of Change

Test Plan

Uh oh!

Uh oh!

Uh oh!

austin-denoble commented May 9, 2025 •

edited

Loading