fix: Use singleton PostgresDBClient (Sqlalchemy engine) #321

yoomlam · 2025-06-17T21:01:03Z

Ticket

https://navalabs.atlassian.net/browse/DST-1025
Test changes for https://navalabs.atlassian.net/browse/DST-1042

Changes

Create a single PostgresDBClient instance, and create sessions from that instance.

Reverts #317, so Promptfoo GH action should default back to 4 threads.

Fixed tests that indirectly create a DB session so that they use the app_config test fixture (which accesses the test DB schema) rather than the non-test app_config (which accesses the real DB schema).

Testing

Tested against Gemini LLM: #321 (comment)
and posted resulting DB connections: #321 (comment)

Preview environment for app

♻️ Environment destroyed ♻️

github-actions · 2025-06-17T21:03:46Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

yoomlam · 2025-06-17T21:03:52Z

.github/workflows/promptfoo-googlesheet-evaluation.yml

-            promptfoo eval --max-concurrency 1 --config "/tmp/promptfooconfig.processed.yaml" --share --output "${OUTPUT_JSON_FILE}" --no-cache | tee "${EVAL_OUTPUT_FILE}"
+            promptfoo eval --config "/tmp/promptfooconfig.processed.yaml" --share --output "${OUTPUT_JSON_FILE}" --no-cache | tee "${EVAL_OUTPUT_FILE}"
          else
-            promptfoo eval --max-concurrency 1 --config "/tmp/promptfooconfig.processed.yaml" --output "${OUTPUT_JSON_FILE}" --no-cache | tee "${EVAL_OUTPUT_FILE}"
+            promptfoo eval --config "/tmp/promptfooconfig.processed.yaml" --output "${OUTPUT_JSON_FILE}" --no-cache | tee "${EVAL_OUTPUT_FILE}"


Reverts #317, so it should default back to 4 threads

yoomlam · 2025-06-17T21:05:05Z

app/src/app_config.py

+    @cached_property
+    def db_client(self) -> db.PostgresDBClient:
+        return db.PostgresDBClient()


This basically creates a singleton PostgresDBClient instance, which holds the Sqlalchemy engine.

“the engine is thread safe yes. individual Connection objects are not. we try to describe this at Working with Engines and Connections — SQLAlchemy 2.0 Documentation”

yoomlam · 2025-06-17T21:06:52Z

app/src/app_config.py

-        return db.PostgresDBClient().get_session()
+        return self.db_client.get_session()


A new session uses an available connection (from the connection pool), so threads (e.g., those created from API calls) will not share a connection.

Our pool size is 20 – what happens when max sqlalchemy pool size is reached?

Google’s AI states:

Requests are queued: Any new requests for a database connection are placed in a queue, waiting for a connection to become available.

Timeout begins: SQLAlchemy starts a timeout period (defaulting to 30 seconds, but configurable) to see if a connection is released back into the pool.

Connection timeout error: If a connection doesn't become available within the timeout period, an exception (e.g., TimeoutError) is thrown, indicating a connection timeout.

SQLAlchemy connection pooling, what are checked out connections? : “If all the connections are simultaneously checked out then you can expect an error (there will be a timeout period during which SQLAlchemy waits to see if a connection gets freed up; this is also configurable).”

app/src/chat_engine.py

.github/workflows/promptfoo-googlesheet-evaluation.yml

github-actions · 2025-06-17T21:39:11Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

github-actions · 2025-06-17T23:13:45Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

github-actions · 2025-06-17T23:26:50Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

github-actions · 2025-06-17T23:29:56Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
4282	3907	91%	80%	🟢

New Files

No new covered files...

Modified Files

File	Coverage	Status
app/src/app_config.py	83%	🟢
TOTAL	83%	🟢

updated for commit: a960bb4 by action🐍

github-actions · 2025-06-18T15:20:48Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

github-actions · 2025-06-18T15:24:16Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
10	5	15	66.67%

View detailed results in Google Sheets

» View eval results «

yoomlam · 2025-06-18T15:50:55Z

Promptfoo run with Gemini 2.5 pro: https://github.com/navapbc/labs-decision-support-tool/actions/runs/15737577700

yoomlam · 2025-06-18T16:17:56Z

DB connections spiked but went down:

github-actions · 2025-06-18T17:51:39Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

Copilot

Pull Request Overview

This PR centralizes database session management by introducing a singleton PostgresDBClient, updates tests to use the app_config fixture for obtaining test sessions, and reverts the Promptfoo GH action to its default concurrency.

Introduce a cached db_client in AppConfig and route db_session through it
Update pytest fixtures and test signatures to include app_config
Remove explicit --max-concurrency flags in Promptfoo workflow to restore default threading

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 1 comment.

File	Description
app/src/app_config.py	Added `db_client` cached property, updated `db_session` to use it
app/tests/conftest.py	Changed `app_config` fixture to depend on `db_client`
app/tests/src/test_retrieve.py and other tests	Updated test signatures to include `app_config` and use `chunk.id`
.github/workflows/promptfoo-googlesheet-evaluation.yml	Removed `--max-concurrency` flags to revert to default concurrency

Comments suppressed due to low confidence (2)

app/src/app_config.py:45

The cached_property decorator is used but not imported; add from functools import cached_property at the top of the file.

    @cached_property

app/tests/conftest.py:111

The db_client fixture is referenced here but not defined; consider adding a db_client pytest fixture or switch back to using the existing db_session fixture.

def app_config(monkeypatch, db_client: db.DBClient):

app/src/app_config.py

github-actions · 2025-06-23T14:40:09Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

github-actions · 2025-06-23T14:45:14Z

Promptfoo Evaluation Results

Success	Failure	Total	Pass Rate
11	4	15	73.33%

View detailed results in Google Sheets

» View eval results «

KevinJBoyer · 2025-06-23T15:53:14Z

app/tests/src/test_retrieve.py

-    assert results[0].chunk == short_chunk
+    assert results[0].chunk.id == short_chunk.id


What's the reason for these changes? Because the db_session from the test fixture (the second parameter to test_retrieve_with_scores) is different from the db_session generated by retrieve_with_scores?

Yes. There's some identifier in the chunk instances that is specific to the db_session, so those identifiers are different and cause the assertion to fail. Otherwise the chunks are identical.

Gotcha. I think the identifier here is the literal address in memory: under the hood, SQLAlchemy ensures that if you retrieve the same row back in the same session, it creates only a single instance of the object in memory so you can do things like (psuedocode):

with some_session: # in this call SQLAlchemy creates a new instance of the Chunk class and returns that chunk_1 = some_session.select(chunk_id="abc").first() # in this call SQLAlchemy will recognize that it already has an instance of the Chunk class for this row, so it will return that Chunk chunk_2 = some_session.select(chunk_id="abc").first() assert chunk_1 == chunk_2 # these are literally the same object in memory

KevinJBoyer

LGTM, just had a question to make sure I understood changes to the test correctly. TY!

yoomlam added 4 commits June 17, 2025 15:50

use single db.PostgresDBClient instance

baf3708

allow promptfoo to use more than 1 thread

0a335fe

add Gemini models

e85f9fb

temp: use gemini 2.5 pro for promptfoo eval

b2ce4aa

yoomlam commented Jun 17, 2025

View reviewed changes

yoomlam added 2 commits June 17, 2025 16:08

de-lint

e709932

add GEMINI_API_KEY to promptfoo action

ad5c1f0

yoomlam commented Jun 17, 2025

View reviewed changes

.github/workflows/promptfoo-googlesheet-evaluation.yml Outdated Show resolved Hide resolved

yoomlam added 2 commits June 17, 2025 18:09

add app_config fixture to failing tests

e340876

TEMP: ignore some tests

5ff8620

fix more tests

bb12097

yoomlam added 2 commits June 17, 2025 18:40

WIP

cf2fd49

add GEMINI_API_KEY to terraform

5b8411e

revert debugging code

7906d97

yoomlam added 3 commits June 18, 2025 12:17

add Gemini LLM

0e3c0a8

remove temporary code

9b001fd

add to terraform

dcd8e45

yoomlam changed the base branch from main to yl/add-gemini-llm June 18, 2025 17:53

yoomlam added 2 commits June 18, 2025 13:48

compare by chunk id

9697231

fix more tests

c22c07d

yoomlam added 2 commits June 18, 2025 15:29

remove print()

8e89b00

Merge branch 'yl/add-gemini-llm' into yl/pgclient-singleton

c17be59

yoomlam marked this pull request as ready for review June 23, 2025 14:22

yoomlam requested a review from Copilot June 23, 2025 14:26

Copilot AI reviewed Jun 23, 2025

View reviewed changes

app/src/app_config.py Outdated Show resolved Hide resolved

Base automatically changed from yl/add-gemini-llm to main June 23, 2025 14:37

Merge branch 'main' into yl/pgclient-singleton

4c44d8d

remove old code

a960bb4

KevinJBoyer reviewed Jun 23, 2025

View reviewed changes

KevinJBoyer approved these changes Jun 23, 2025

View reviewed changes

yoomlam merged commit fd366a0 into main Jun 23, 2025
13 checks passed

yoomlam deleted the yl/pgclient-singleton branch June 23, 2025 16:17

		return db.PostgresDBClient().get_session()
		return self.db_client.get_session()

		assert results[0].chunk == short_chunk
		assert results[0].chunk.id == short_chunk.id

fix: Use singleton PostgresDBClient (Sqlalchemy engine) #321

fix: Use singleton PostgresDBClient (Sqlalchemy engine) #321

Uh oh!

Conversation

yoomlam commented Jun 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Ticket

Changes

Testing

Preview environment for app

Uh oh!

github-actions bot commented Jun 17, 2025

Promptfoo Evaluation Results

Uh oh!

yoomlam Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

yoomlam Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

yoomlam Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jun 17, 2025

Promptfoo Evaluation Results

Uh oh!

github-actions bot commented Jun 17, 2025

Promptfoo Evaluation Results

Uh oh!

github-actions bot commented Jun 17, 2025

Promptfoo Evaluation Results

Uh oh!

github-actions bot commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

☂️ Python Coverage

Overall Coverage

New Files

Modified Files

Uh oh!

github-actions bot commented Jun 18, 2025

Promptfoo Evaluation Results

Uh oh!

github-actions bot commented Jun 18, 2025

Promptfoo Evaluation Results

Uh oh!

yoomlam commented Jun 18, 2025

Uh oh!

yoomlam commented Jun 18, 2025

Uh oh!

github-actions bot commented Jun 18, 2025

Promptfoo Evaluation Results

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

github-actions bot commented Jun 23, 2025

Promptfoo Evaluation Results

Uh oh!

github-actions bot commented Jun 23, 2025

Promptfoo Evaluation Results

Uh oh!

KevinJBoyer Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

yoomlam Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

KevinJBoyer Jun 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

KevinJBoyer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yoomlam commented Jun 17, 2025 •

edited by github-actions bot

Loading

github-actions bot commented Jun 17, 2025 •

edited

Loading

KevinJBoyer Jun 23, 2025 •

edited

Loading