Skip to content

Feature request/discussion #76

@davidbp

Description

@davidbp

Hi,

I am working on a somewhat related topic which would intersect with this package and maybe association rules.

problem 1

I have a list of search terms and I would like to understand "how associated" is a consequent to an antecedent.
For example, if I write "halloween" what are "the most likely" ngrams , "the most uniquely related to" n-grams, etc...

I have data of the form: text, counts, categorical variable (for example COSTUME_OUTFIT if the text is mostly related to a costume) and I am building a table similar to this (here the antecedent is "halloween")

| # | query                              | consequent                | category              | support        | confidence | lift         | conviction |
|---|------------------------------------|---------------------------|-----------------------|----------------|-------------|--------------|-------------|
| 0 | halloween decorations              | decorations               | LIGHTED_OUTDOOR_DISPLAY | 6.122921e-06   | 0.073535    | 24.696175    | 1.076157    |
| 1 | halloween costumes for women adult | costumes for women adult  | COSTUME_OUTFIT        | 6.576034e-07   | 0.007898    | 11147.867556 | 1.007960    |

Do you think something similar could be done with this package?

Here it is relevant to note that I have (text, counts) pairs, not just an streams of text, the text is very short (search queries).

problem 2

I have and extended table with date (text, counts, categorical_var, date) and I would like to find out which associations mostly occur on different dates (holidays, seasons) etc...

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions