-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Hi,
I am working on a somewhat related topic which would intersect with this package and maybe association rules.
problem 1
I have a list of search terms and I would like to understand "how associated" is a consequent to an antecedent.
For example, if I write "halloween" what are "the most likely" ngrams , "the most uniquely related to" n-grams, etc...
I have data of the form: text, counts, categorical variable (for example COSTUME_OUTFIT if the text is mostly related to a costume) and I am building a table similar to this (here the antecedent is "halloween")
| # | query | consequent | category | support | confidence | lift | conviction |
|---|------------------------------------|---------------------------|-----------------------|----------------|-------------|--------------|-------------|
| 0 | halloween decorations | decorations | LIGHTED_OUTDOOR_DISPLAY | 6.122921e-06 | 0.073535 | 24.696175 | 1.076157 |
| 1 | halloween costumes for women adult | costumes for women adult | COSTUME_OUTFIT | 6.576034e-07 | 0.007898 | 11147.867556 | 1.007960 |
Do you think something similar could be done with this package?
Here it is relevant to note that I have (text, counts) pairs, not just an streams of text, the text is very short (search queries).
problem 2
I have and extended table with date (text, counts, categorical_var, date) and I would like to find out which associations mostly occur on different dates (holidays, seasons) etc...