Change the HFOnnx pipeline to use Hugging Face Optimum rather than onnxruntime directly

The HF documentation says that you can now export seq2seq to ONNX with the OnnxSeq2SeqConfigWithPast class.
https://huggingface.co/docs/transformers/v4.23.1/en/main_classes/onnx#onnx-configurations

This was added with this PR in March https://github.com/huggingface/transformers/pull/14700

Perhaps it is sufficient to be incorporated into txtai now? It would be great to be able to use ONNX versions of the various HF models, for their increased performance.

Additionally, it seems to support ViT models, along with other enhancements that have been made since then. Here's the history for that class https://github.com/huggingface/transformers/commits/main/src/transformers/onnx/config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Change the HFOnnx pipeline to use Hugging Face Optimum rather than onnxruntime directly #371

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Change the HFOnnx pipeline to use Hugging Face Optimum rather than onnxruntime directly #371

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions