Edge count num doesnot equals the edge_df.count? #644
Replies: 1 comment
-
After debugging the write process, it was found that duplicate vertex primary keys (PK) occurred during the definition of the Vertex_DF, leading to duplicate PKs in the vertex indexing. This caused duplicate edges when the internal edge_df joined the edge table with the vertex mapping, resulting in an increased edge count. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I read three columns from orc files, and construct a edge_df use: orcfiles_df.select("src", "dst", "attr"), and use GraphWrite to write a graphdata , use putEdgeData(tag, edge_df) to write . but when write successfully, I read the write data with EdgeReader,the edge_count files read num is more bigger than the edge_df.count(). By the way , the vertex_df.count() equals the numbers in vertex_count file .
Is there any method that requires special attention when using it, which I might have overlooked?
Beta Was this translation helpful? Give feedback.
All reactions