[Proposal] Out-of-tree Document Loader #6731
yaohui-wyh
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Background
Document Loaders are currently all "in-tree," meaning that their implementation is included in the LangChain repository. This increases the size of the LangChain framework and potentially introduces security risks, particularly for poorly implemented Document Loaders that fetch and parse remote resources.
This also places a heavy burden on LangChain contributors and Data vendors to upgrade or fix the current Loader implementations. A significant portion of issues & pull requests are related to Document Loaders, as can be seen here: https://github.com/hwchase17/langchain/issues?q=document+loader
Proposal (Draft)
Similar to the Kubernetes CSI Volume Plugin, we could draft a specification for the "LangChain Document Loader" plugin and allow the development & release of Document Loaders outside of the core LangChain codebase/release.
This would be beneficial for the LangChain ecosystem and also increase the production stability.
Beta Was this translation helpful? Give feedback.
All reactions