Simple example for an agentic flow (Current Bengaluru talk) #287

anelook · 2025-03-17T17:56:55Z

No description provided.

sandonjacobs

Found a typo in one of the sql scripts.

simple-agentic-flow-flink-sql/1-create-connections-with-confluent-cli.md

davetroiano · 2025-03-17T18:38:14Z

simple-agentic-flow-flink-sql/1-create-connections-with-confluent-cli.md

+## Pinecone
+
+```bash
+confluent flink connection create pinecone-connection --environment your-confluent-environment-name \


Suggested change

confluent flink connection create pinecone-connection --environment your-confluent-environment-name \

confluent flink connection create pinecone-connection --environment your-confluent-environment-id \

davetroiano · 2025-03-17T18:46:28Z

simple-agentic-flow-flink-sql/README.md

+
+## Populating Pinecone with Vector Data
+
+Example values used in this demo can be found in `documentation-sample.json`.


Add instructions / link for getting started with pinecone (link to https://app.pinecone.io/, copy your endpoint and API key, create index with the embedding that will match what we use in OpenAI).

Add instructions so users can go from this JSON sample to embeddings -- this is where I got stuck

I linked to the article by Diptiman on this topic! I honestly didn't want to go into all depth and nuances, there are many ways to load the data to vector store. The value of this repo is to have a set of SQL commands that I show in the slides, and Pinecone is outside of the scope. This was supposed to be an accompanying material for Bengaluru talk to have some code outside of slides.

davetroiano · 2025-03-17T18:49:48Z

simple-agentic-flow-flink-sql/README.md

+
+## Setting Up Connections Using Confluent CLI
+
+Once external sources like **Pinecone** and **OpenAI** are configured, use the **Confluent CLI** to establish secure connections. Refer to [`1-create-connections-with-confluent-cli.md`](1-create-connections-with-confluent-cli.md) for examples.


There are a lot of platform and data seeding prereqs to get through and IMO this doesn't provide enough guidance. I'd suggest a section for platform setup that walks through the signup links plus any specifics that you have to do for the demo to work (e.g. I assume the pinecone vector type is critical?). Doesn't have to hand hold with screen shots but should give someone what they need to do to set the table. I tried to get there on my own and gave up on not following how to populate pinecone from the sample JSON.

Confluent cloud. Signup link plus maybe use the quick start to create kafka cluster and compute pool:

confluent flink quickstart \ --name simple_agentic_rag \ --max-cfu 10 \ --region us-east-1 \ --cloud aws

pinecone: signup link, copy endpoint and API key, and which embedding type to pick when you create an index

openAI: signup link, billing, API key creation

Atlas: same deal

Yes, I believe that my approach to this repo was not a full tutorial with step by step detailed guidance, but a brief list of instructions together with all code snippets used.

The talk is the main delivery, and this is an accompanying repo, so that people don't have to type code from the slides.

Or maybe this repo isn't a good place for it?

I see what you're aiming for... it's currently at a bit of a crossroads because I expected to be able to recreate the demo fairly happily and I don't think that's going to happen if someone tries to. my suggestion is either:

a) make is clear that this is really ideal for reading along with the talk, or stealing bits of it for people looking to do similar parts of it. it's not a folder where readers should expect end-to-end recreation (in under an hour). as I reviewed it that's what I expected and attempted and I think others who find this will do the same.

b) add enough instructions to make it an end to end journey. doesn't have to be super hand-holdy with screen shots. e.g., the pinecone signup would be a sentence like "Create a Pinecone account and create an index configured for OpenAI's test-embedding-3-small model". For json to pinecone, Add a Python script / snippet that a techie would be able to take a massage to work with their index by plugging in their API key. etc.

I like (b) for all github examples since I think devs will have an expectation unless you set the expectations clearly by making changes along the lines of (a)... but them IMO (a) winds up being closer to a blog in terms of audience / expectations

davetroiano · 2025-03-17T18:55:08Z

simple-agentic-flow-flink-sql/1-create-connections-with-confluent-cli.md

+
+```bash
+confluent flink connection create openai-connection-vector-embeddings \
+--environment your-confluent-environment-name \


Suggested change

--environment your-confluent-environment-name \

--environment your-confluent-environment-id \

oh, good catch!

davetroiano · 2025-03-17T18:55:33Z

simple-agentic-flow-flink-sql/2-create-tables-and-models.sql

+(
+    conversation_id string NOT NULL,
+    customer_id     string NOT NULL,
+    cusomer_message string NOT NULL,


Not going to flag them all but there are 20 or so cases in this PR to change cusomer to customer

Suggested change

cusomer_message string NOT NULL,

customer_message string NOT NULL,

davetroiano · 2025-03-17T18:57:50Z

simple-agentic-flow-flink-sql/2-create-tables-and-models.sql

+    conversation_id  STRING NOT NULL,
+    customer_id      STRING NOT NULL,
+    cusomer_message  String NOT NULL,
+    chatbot_response String


total nit but suggest all caps data types throughout this PR. BIGINT and STRING everywhere

good point, replaced

davetroiano · 2025-03-17T18:58:30Z

simple-agentic-flow-flink-sql/3-setup-pipeline-flow.sql

+SELECT * FROM customer_message WHERE customer_id = 'customer_3'
+
+---------------------------------- CALL TO EMBEDDING API ------------------------------------------
+    INSERT INTO customer_message_and_embedding


Is the indented INSERT throughout intentional? looks kinda weird to me. suggest lining up INSERT and SELECT throughout

Suggested change

INSERT INTO customer_message_and_embedding

INSERT INTO customer_message_and_embedding

davetroiano · 2025-03-17T19:01:54Z

simple-agentic-flow-flink-sql/4-clean-up.sql

@@ -0,0 +1,19 @@
+
+DROP TABLE customer_message


if you use the flink quick start plugin then you can just delete the environment and flink API key. make it a little easier on people to only have to run a couple of commands

Agree, removed that file

diptimanr · 2025-03-19T07:45:10Z

LGTM

diptimanr

LGTM

Adding code used in example for Current Bengaluru talk

13eff96

anelook requested a review from a team as a code owner March 17, 2025 17:56

sandonjacobs requested changes Mar 17, 2025

View reviewed changes

simple-agentic-flow-flink-sql/1-create-connections-with-confluent-cli.md Outdated Show resolved Hide resolved

davetroiano requested changes Mar 17, 2025

View reviewed changes

anelook requested a review from diptimanr March 19, 2025 07:46

diptimanr approved these changes Mar 19, 2025

View reviewed changes

added changes after review

d0fad05

sandonjacobs self-requested a review March 19, 2025 12:05

sandonjacobs approved these changes Mar 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simple example for an agentic flow (Current Bengaluru talk) #287

Simple example for an agentic flow (Current Bengaluru talk) #287

anelook commented Mar 17, 2025

sandonjacobs left a comment

davetroiano Mar 17, 2025

davetroiano Mar 17, 2025

anelook Mar 19, 2025

davetroiano Mar 17, 2025

anelook Mar 19, 2025

davetroiano Mar 19, 2025

davetroiano Mar 17, 2025

anelook Mar 19, 2025

davetroiano Mar 17, 2025

davetroiano Mar 17, 2025

anelook Mar 19, 2025

davetroiano Mar 17, 2025

anelook Mar 19, 2025

davetroiano Mar 17, 2025

anelook Mar 19, 2025

diptimanr commented Mar 19, 2025

diptimanr left a comment

	confluent flink connection create pinecone-connection --environment your-confluent-environment-name \
	confluent flink connection create pinecone-connection --environment your-confluent-environment-id \


		## Populating Pinecone with Vector Data

		Example values used in this demo can be found in `documentation-sample.json`.


		## Setting Up Connections Using Confluent CLI

		Once external sources like Pinecone and OpenAI are configured, use the Confluent CLI to establish secure connections. Refer to [`1-create-connections-with-confluent-cli.md`](1-create-connections-with-confluent-cli.md) for examples.

	--environment your-confluent-environment-name \
	--environment your-confluent-environment-id \

	cusomer_message string NOT NULL,
	customer_message string NOT NULL,

	INSERT INTO customer_message_and_embedding
	INSERT INTO customer_message_and_embedding

Simple example for an agentic flow (Current Bengaluru talk) #287

Are you sure you want to change the base?

Simple example for an agentic flow (Current Bengaluru talk) #287

Conversation

anelook commented Mar 17, 2025

sandonjacobs left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

diptimanr commented Mar 19, 2025

diptimanr left a comment

Choose a reason for hiding this comment