Skip to content
Discussion options

You must be logged in to vote

Deduplicating / filtering false-positive tweets

  • Keep a short-term cache (Redis or in-memory) of processed tweet IDs.

  • Filter retweets (retweeted_status), replies (in_reply_to_status_id), and quote tweets.

  • For duplicated content (same token launch tweeted by multiple accounts), keep only the first signal per token or per timeframe.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by machenxi
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants