@@ -122,41 +122,6 @@ git clone https://huggingface.co/datasets/gaia-benchmark/GAIA
122
122
python examples/run_gaia.py
123
123
```
124
124
125
- ### Deploying Qwen Models via vLLM
126
-
127
- #### Step 1: Launch the vLLM Inference Service
128
-
129
- ``` bash
130
- nohup bash -c ' CUDA_VISIBLE_DEVICES=0,1 python -m vllm.entrypoints.openai.api_server \
131
- --model /input0/Qwen3-32B \
132
- --served-model-name Qwen \
133
- --host 0.0.0.0 \
134
- --port 8000 \
135
- --max-num-seqs 16 \
136
- --enable-auto-tool-choice \
137
- --tool-call-parser hermes \
138
- --tensor_parallel_size 2' > vllm_qwen.log 2>&1 &
139
- ```
140
-
141
- Update ` .env ` :
142
-
143
- ``` bash
144
- QWEN_API_BASE=http://localhost:8000/v1
145
- QWEN_API_KEY=" abc"
146
- ```
147
-
148
- #### Step 2: Launch the Agent Service
149
-
150
- ``` bash
151
- python main.py
152
- ```
153
-
154
- Example command:
155
-
156
- ``` bash
157
- Use deep_researcher_agent to search the latest papers on the topic of ' AI Agent' and then summarize it.
158
- ```
159
-
160
125
## Experiments
161
126
162
127
We evaluated our agent on the GAIA validation set and achieved state-of-the-art performance on May 10th.
@@ -195,15 +160,57 @@ playwright install chromium --with-deps --no-shell
195
160
196
161
Function-calling is now supported natively by GPT-4.1 / Gemini 2.5 Pro. Claude-3.7-Sonnet is also recommended.
197
162
163
+ ### 4. Use vllm for local models
164
+ We provide huggingface as a shortcut to the local model. Also provide vllm as a way to start services so that parallel acceleration can be provided.
165
+
166
+ #### Step 1: Launch the vLLM Inference Service
167
+
168
+ ``` bash
169
+ nohup bash -c ' CUDA_VISIBLE_DEVICES=0,1 python -m vllm.entrypoints.openai.api_server \
170
+ --model /input0/Qwen3-32B \
171
+ --served-model-name Qwen \
172
+ --host 0.0.0.0 \
173
+ --port 8000 \
174
+ --max-num-seqs 16 \
175
+ --enable-auto-tool-choice \
176
+ --tool-call-parser hermes \
177
+ --tensor_parallel_size 2' > vllm_qwen.log 2>&1 &
178
+ ```
179
+
180
+ Update ` .env ` :
181
+
182
+ ``` bash
183
+ QWEN_API_BASE=http://localhost:8000/v1
184
+ QWEN_API_KEY=" abc"
185
+ ```
186
+
187
+ #### Step 2: Launch the Agent Service
188
+
189
+ ``` bash
190
+ python main.py
191
+ ```
192
+
193
+ Example command:
194
+
195
+ ``` bash
196
+ Use deep_researcher_agent to search the latest papers on the topic of ' AI Agent' and then summarize it.
197
+ ```
198
+
198
199
## Acknowledgement
199
200
200
- DeepResearchAgent is inspired by and improved upon:
201
+ DeepResearchAgent is primarily inspired by the architecture of smolagents. The following improvements have been made:
202
+ - The codebase of smolagents has been modularized for better structure and organization.
203
+ - The original synchronous framework has been refactored into an asynchronous one.
204
+ - The multi-agent setup process has been optimized to make it more user-friendly and efficient.
205
+
206
+ We would like to express our gratitude to the following open source projects, which have greatly contributed to the development of this work:
207
+ - [ smolagents] ( https://github.com/huggingface/smolagents ) - A lightweight agent framework.
208
+ - [ OpenManus] ( https://github.com/mannaandpoem/OpenManus ) - An asynchronous agent framework.
209
+ - [ browser-use] ( https://github.com/browser-use/browser-use ) - An AI-powered browser automation tool.
210
+ - [ crawl4ai] ( https://github.com/unclecode/crawl4ai ) - A web crawling library for AI applications.
211
+ - [ markitdown] ( https://github.com/microsoft/markitdown ) - A tool for converting files to Markdown format.
201
212
202
- * [ smolagents] ( https://github.com/huggingface/smolagents )
203
- * [ OpenManus] ( https://github.com/mannaandpoem/OpenManus )
204
- * [ browser-use] ( https://github.com/browser-use/browser-use )
205
- * [ crawl4ai] ( https://github.com/unclecode/crawl4ai )
206
- * [ markitdown] ( https://github.com/microsoft/markitdown )
213
+ We sincerely appreciate the efforts of all contributors and maintainers of these projects for their commitment to advancing AI technologies and making them available to the wider community.
207
214
208
215
## Contribution
209
216
0 commit comments