Continuously reporting format errors when calling Tencent Cloud's vector database to add filtering conditions

 in     .venv/Lib/site-packages/langchain_community/vectorstores/tencentvectordb.py
 there is a function called "similarity_search_by_vector". When called externally, no matter what format of filter is passed in, it will report an error;


`    def search(self, query: str, vectors: List[List[float]], limit: int = 5, filters: Optional[Dict] = None):
        """
        Search for similar vectors in LangChain.
        """
        # For each vector, perform a similarity search
        if filters:
            results = self.client.similarity_search_by_vector(embedding=vectors, k=limit, filter=filters)
        else:
            results = self.client.similarity_search_by_vector(embedding=vectors, k=limit)

        final_results = self._parse_output(results)
        return final_results
`
The source code in the open-source framework mem0 is as shown above, but when called, the following error will be reported

<img width="1293" height="822" alt="Image" src="https://github.com/user-attachments/assets/8a2058cb-d9fc-435e-ad1a-d2cb37e82a67" />



Change the source code of mem0 to the following


`  def search(self, query: str, vectors: List[List[float]], limit: int = 5, filters: Optional[Dict] = None):
        """
        Search for similar vectors in LangChain / TencentVectorDB.
        Compatible with TencentVectorDB filter grammar.
        """
        filter_expr = None
        if filters:
            if isinstance(filters, dict):
                # 转换为 LangChain/TencentVectorDB DSL 可解析格式
                filter_parts = []
                for k, v in filters.items():
                    if v is None:
                        continue
                    # 自动判断类型，加引号
                    if isinstance(v, str):
                        v = v.replace('"', '\\"')  # 转义双引号
                        filter_parts.append(f'{k} == "{v}"')
                    else:
                        filter_parts.append(f'{k} == {v}')
                filter_expr = " and ".join(filter_parts)

            elif isinstance(filters, str):
                # 容错转换: 单等号改双等号, 单引号改双引号
                filter_expr = filters.replace(" = ", " == ").replace("'", '"')

        # （可选）日志调试
        # print(f"[VectorSearch] filter_expr={filter_expr}")

        if filter_expr:
            results = self.client.similarity_search_by_vector(
                embedding=vectors, k=limit, filter=filter_expr
            )
        else:
            results = self.client.similarity_search_by_vector(
                embedding=vectors, k=limit
            )

        final_results = self._parse_output(results)
        return final_results
`

Report the following error，The concatenated string is also incorrect；

<img width="1244" height="977" alt="Image" src="https://github.com/user-attachments/assets/882abedc-0242-4b31-b79f-ea8ee90c37db" />

<img width="1103" height="577" alt="Image" src="https://github.com/user-attachments/assets/6367712b-27e9-453c-b917-b034fd22c523" />


1) The first mistake is
In the similarity search vector call of TencentVectorDB, the passed filter parameter is not a string or None, but a dictionary or other type, causing the Lark parser in the underlying translate_filter() function to report an error: TypeError: text must be str or bytes


2) The second mistake is
The Lark syntax parser used internally by TencentVectorDB does not accept traditional SQL style expressions (user_i='zz ').
In the source code of langchain_comunity. vectorstores. tencentvectordb (you can open it to see the translate_filter definition),
The expected filter expression syntax of Tencent Vector Database's LangChain wrapper is actually JSON style or Python logical expression, rather than SQL format.


but  Spelling the expression as user_id=="zz" and... still rejected by Lark......

So no matter how you try to fix it, it always reports an error. Is there a bug in this area? Or was there something I didn't notice? How should I modify it?

.venv/Lib/site-packages/langchain_community/vectorstores/tencentvectordb.py，The relevant source code is as follows



<img width="888" height="389" alt="Image" src="https://github.com/user-attachments/assets/8df7dad2-1914-4975-9661-8b0738c2e8a2" />


<img width="1072" height="801" alt="Image" src="https://github.com/user-attachments/assets/089c73c9-978a-4460-a0d2-c2a659c68e4f" />


<img width="1003" height="804" alt="Image" src="https://github.com/user-attachments/assets/5c4b3edb-791e-4b90-8055-b8d212dfeea8" />





















Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Continuously reporting format errors when calling Tencent Cloud's vector database to add filtering conditions #876

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Continuously reporting format errors when calling Tencent Cloud's vector database to add filtering conditions #876

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions