Uploading openparsed docs to Vector DB #25
Unanswered
GedeonIsezerano
asked this question in
Q&A
Replies: 1 comment
-
I'm not familiar with the details of pinecone, but each node has a import openparse
basic_doc_path = "./sample-docs/mobile-home-manual.pdf"
parser = openparse.DocumentParser()
parsed_basic_doc = parser.parse(basic_doc_path)
content_dict = parsed_basic_doc.dict()
for node in content_dict["nodes"]:
node["vector"] = your_function_to_embed_text(node["text"]) We will likely add the vector attribute directly to the node but currently it get's thrown out after the pipeline is finished running. I will update this answer when that gets implemented. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
What would be the best way to upload the parsed data to a vector db such as pinecone? The output from Documentparser provides nodes but not embedding values.
Here is my code
Beta Was this translation helpful? Give feedback.
All reactions