-
|
Should we implement a supervised classifier, rule-based filters, or hybrid approach — and what datasets are needed? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
Classifier choice: supervised, rule-based, or hybrid
|
Beta Was this translation helpful? Give feedback.
Classifier choice: supervised, rule-based, or hybrid
Rule-based: simple, fast to implement (keywords, hashtags, author reputation). Works well for clear patterns.
Supervised ML classifier: learns complex patterns, better at nuance, but requires labeled dataset of tweets → “token-launch” vs “non-token” signals.
Hybrid (recommended): rules filter obvious non-signals first, then ML scores remaining tweets. This reduces noise and improves reliability.
Datasets needed: historical token launch tweets, labeled for success/failure; negative samples (noise) to reduce false positives; optionally augment with features like user reputation, linked contract addresses, timestamps.