Configuring localGPT for production #472

AnandMoorthy · 2023-09-13T08:01:07Z

AnandMoorthy
Sep 13, 2023

Hi,

I am planning to configure the project to production, i am expecting around 10 peoples to use this concurrently. My current setup is RTX 4090 with 24Gig memory. Flask app is working fine when a single user using localGPT but when multiple requests comes in at the same time the app is crashing.

Also i see whenever request comes in GPU goes 100%, is there any way with the current configuration i can server for around 10 peoples concurrently? Other suggestions are welcome too :)

Thanks!

AnandMoorthy · 2023-09-14T12:04:45Z

AnandMoorthy
Sep 14, 2023
Author

@PromtEngineer Need your input on this!

0 replies

matheus-mondaini · 2024-04-17T20:01:23Z

matheus-mondaini
Apr 17, 2024

@PromtEngineer It's the same issue that my team and I have

0 replies

PromtEngineer · 2024-05-03T06:03:13Z

PromtEngineer
May 3, 2024
Maintainer

@AnandMoorthy @matheus-mondaini we will need to implement queue in the api for multiple users, should be relatively easy to implement. Will have a look. Getting back to this project back soon.

0 replies

frouenque · 2025-04-18T19:47:41Z

frouenque
Apr 18, 2025

Hello, @PromtEngineer , do you have any news about this feature ?

Thanks ;)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Configuring localGPT for production #472

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Configuring localGPT for production #472

Uh oh!

AnandMoorthy Sep 13, 2023

Replies: 4 comments

Uh oh!

AnandMoorthy Sep 14, 2023 Author

Uh oh!

matheus-mondaini Apr 17, 2024

Uh oh!

PromtEngineer May 3, 2024 Maintainer

Uh oh!

frouenque Apr 18, 2025

AnandMoorthy
Sep 13, 2023

AnandMoorthy
Sep 14, 2023
Author

matheus-mondaini
Apr 17, 2024

PromtEngineer
May 3, 2024
Maintainer

frouenque
Apr 18, 2025