feat: add inference backends #74

ChenZiHong-Gavin · 2025-10-27T06:20:55Z

This PR introduces several LLM api client and inference backends, including:

http_client
ollama_client
openai_client
hf
sglang
tgi(wip)
trt(wip)
vllm(wip)

.env changes can be found in .env.example

# Tokenizer
TOKENIZER_MODEL=

# LLM
# Support different backends: http_api, openai_api, ollama_api, ollama, huggingface, tgi, sglang, tensorrt

# http_api / openai_api
SYNTHESIZER_BACKEND=openai_api
SYNTHESIZER_MODEL=gpt-4o-mini
SYNTHESIZER_BASE_URL=
SYNTHESIZER_API_KEY=
TRAINEE_BACKEND=openai_api
TRAINEE_MODEL=gpt-4o-mini
TRAINEE_BASE_URL=
TRAINEE_API_KEY=

# # ollama_api
# SYNTHESIZER_BACKEND=ollama_api
# SYNTHESIZER_MODEL=gemma3
# SYNTHESIZER_BASE_URL=http://localhost:11434
#
# Note: TRAINEE with ollama_api backend is not supported yet as ollama_api does not support logprobs.

# # huggingface
# SYNTHESIZER_BACKEND=huggingface
# SYNTHESIZER_MODEL=Qwen/Qwen2.5-0.5B-Instruct
#
# TRAINEE_BACKEND=huggingface
# TRAINEE_MODEL=Qwen/Qwen2.5-0.5B-Instruct

…to feature/inference-backend

tpoisonooo · 2025-10-30T01:50:04Z

对应的 README 要更新，支持哪些 feature 得能看到
还有个有效的是自动扩展负载，例如调一下 ray[serve] api

https://github.com/SeedLLM/DataPolisher/pull/1/files#diff-5648623a11374bdc84a573cac0a89d4e93d162c80c8938c82780f76c96c4373c

tpoisonooo · 2025-10-30T01:52:14Z

这种：

或者这种：

都可以参考。

ChenZiHong-Gavin added 28 commits October 27, 2025 11:58

chore: delete duplicate model

ba48ee5

feat: add huggingface wrapper

d1e0af5

refactor: change file name

a3a0c60

feat: add ds_wrapper, trt_wrapper, sglang_wrapper

61766b5

refactor: refactor graphgen

e6f4502

wip: add azure client

5a1fc4d

Merge branch 'main' of https://github.com/open-sciencelab/GraphGen in…

0276402

…to feature/inference-backend

feat: add ollama_wrapper, tgi_wrapper

276f881

feat: add azure_client, http_client, ollama_client

846b924

delete azure_client

f6bdaf6

tests: add http_client test

ee2d35e

docs: update .env example

d02e5a2

feat: switch llm backend(http_api)

614283f

fix: fix ollama_client

abc8dc2

wip: fix ollama_client

ebf9d1c

tests: add ollama_client tests

fac9997

fix: fix generate_topk_per_token in ollmam_client

f5a4594

fix: delete useless tests

d4beb52

fix: fix transformers warning not using GenerationConfig

c8055f1

fix: fix _build_inputs in hf_wrapper

ae9b28b

fix: fix gen_kwargs

292d986

chore: delete ds_wrapper

f562ee2

feat: add vllm_wrapper

399ef45

wip:sglang backend

569db1d

fix: change llm_wrapper type

e86cbbf

wip: add sglang_wrapper

095d18a

docs: update .env.example

8c92ffd

fix: fix parsing token_logprobs in sglang_wrapper

03e6d23

ChenZiHong-Gavin marked this pull request as ready for review October 29, 2025 11:25

ChenZiHong-Gavin merged commit 6e4a142 into main Oct 29, 2025
3 checks passed

ChenZiHong-Gavin deleted the feature/inference-backend branch October 29, 2025 11:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add inference backends #74

feat: add inference backends #74

Uh oh!

ChenZiHong-Gavin commented Oct 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

tpoisonooo commented Oct 30, 2025

Uh oh!

tpoisonooo commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add inference backends #74

feat: add inference backends #74

Uh oh!

Conversation

ChenZiHong-Gavin commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

tpoisonooo commented Oct 30, 2025

Uh oh!

tpoisonooo commented Oct 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ChenZiHong-Gavin commented Oct 27, 2025 •

edited

Loading