Skip to content

Multiple Branches #370

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 17 commits into from
Jun 11, 2024
Merged

Multiple Branches #370

merged 17 commits into from
Jun 11, 2024

Conversation

PeriniM
Copy link
Contributor

@PeriniM PeriniM commented Jun 11, 2024

Embeddings Cache

Now it is possible to specify the cache_path param in the graph configuration to cache the vector store indexes for a specific source. This way we don't need to create the embeddings every time if we want to scrape different things from the same source.

Semchunk

Replaced langchain text splitter with semchunk

Added ScriptCreatorMultiGraph

New graph that allows to create a python script for scraping multiple urls.

script_creator_graph = ScriptCreatorMultiGraph(
    prompt="Who is Marco Perini?",
    source=[
        "https://perinim.github.io/",
        "https://perinim.github.io/cv/"
    ],
    config=graph_config
)

It is also possible to pass a pydantic schema.

@PeriniM PeriniM merged commit 6f994ce into pre/beta Jun 11, 2024
3 checks passed
@PeriniM PeriniM deleted the dev branch June 11, 2024 23:54
Copy link

🎉 This PR is included in version 1.7.0-beta.3 🎉

The release is available on:

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants