Skip to content

Commit 1f72836

Browse files
second iteration
1 parent fa0caca commit 1f72836

12 files changed

+16
-14
lines changed

README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -159,7 +159,8 @@ Here we can finally get on the main part: making your own, best of them all, man
159159
We need to load our data prepared previously with Dataset:
160160

161161
```python
162-
dataset = Dataset('/Wiki_raw_set/wiki_data.csv')
162+
DATASET_PATH = '/Wiki_raw_set/wiki_data.csv'
163+
dataset = Dataset(DATASET_PATH)
163164
```
164165

165166
### Make initial model
@@ -247,9 +248,10 @@ best_model = experiment.select(perplexity_criterion)
247248
```
248249

249250
### Alternatively: Use Recipes
250-
If you need a topic model now, you can use one ofthe code snippets we call recipes.
251+
If you need a topic model now, you can use one of the code snippets we call *recipes*.
251252
```python
252253
from topicnet.cooking_machine.recipes import BaselineRecipe
254+
253255
training_pipeline = BaselineRecipe()
254256
EXPERIMENT_PATH = '/home/user/experiment/'
255257

topicnet/demos/README.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,27 +1,27 @@
1-
### Demo
1+
# Demo
22
This section provides demonstrations of how to use this library in NLP tasks.
33

4-
`1-RTL-Wiki-Preprocessing` -- notebook describing how to get a wikipedia dataset and write data in VW format.
4+
1. [RTL-Wiki-Preprocessing](RTL-Wiki-Preprocessing.ipynb) -- notebook describing how to get a Wikipedia dataset and write data in VW format.
55

6-
`2-RTL-Wiki-Building-Topic-Model` -- notebook with first steps to build topic model by consequently tuning it's hyperparameters
6+
2. [RTL-Wiki-Building-Topic-Mode](RTL-Wiki-Building-Topic-Model.ipynb) -- notebook with first steps to build topic model by consequently tuning its hyperparameters
77

8-
`3-Visualizing-Your-Model-Documents` -- notebook providing a fres outlook on unstructured document collection with the help of a topic model
8+
3. [Visualizing-Your-Model-Documents](Visualizing-Your-Model-Documents.ipynb) -- notebook providing a fresh outlook on unstructured document collection with the help of a topic model
99

10-
`4-20NG-Preprocessing` -- preparing data from a well-know 20 Newsgroups dataset
10+
4. [20NG-Preprocessing](20NG-Preprocessing.ipynb) -- preparing data from a well-know 20 Newsgroups dataset
1111

12-
`5-20NG-GenSim-vs-TopicNet` -- a comparisson between two topic models build by Gensim and TopicNet library. In the notebook we compare model topics by calculating their [UMass coherence measure](https://palmetto.demos.dice-research.org/) and using Jaccard measure to compare topic top-tokens diversity
12+
5. [20NG-GenSim-vs-TopicNet](20NG-GenSim-vs-TopicNet.ipynb) -- a comparison between two topic models build by Gensim and TopicNet library. In the notebook we compare model topics by calculating their [UMass coherence measure](https://palmetto.demos.dice-research.org/) and using Jaccard measure to compare topic top-tokens diversity
1313

14-
`6-Postnauka-Building-Topic-Model` -- an analog of the RTL-Wiki notebook performed on the corpus of Russian pop-science articles given by postnauka.ru
14+
6. [PostNauka-Building-Topic-Model](PostNauka-Building-Topic-Model.ipynb)-- an analog of the RTL-Wiki notebook performed on the corpus of Russian pop-science articles given by postnauka.ru
1515

16-
`7-Postnauka-Recipe` -- a demonstration of rapid-prototyping methods provided by the library
16+
7. [PostNauka-Recipe](PostNauka-Recipe) -- a demonstration of rapid-prototyping methods provided by the library
1717

18-
`8-Coherence-Maximization-Recipe` -- a recipe for hyperparameter search in regard to custom Coherence metric
18+
8. [Coherence-Maximization-Recipe](Coherence-Maximization-Recipe.ipynb) -- a recipe for hyperparameter search in regard to custom Coherence metric
1919

20-
`9-Topic-Prior-Regularizer-Tutorial` -- a demonstration of the approach to learning topics from the unbalanced corpus
20+
9. [Topic-Prior-Regularizer-Tutorial](Topic-Prior-Regularizer-Tutorial.ipynb) -- a demonstration of the approach to learning topics from the unbalanced corpus
2121

22-
`10-Making-Decorrelation-and-Topic-Selection-Friends` -- reproduction of a very complicated experiment on automatically learning optimal number of topics from the collection. Hurdle is - both needed regularizers when working together nullify token-topic matrix.
22+
10. [Making-Decorrelation-and-Topic-Selection-Friends](Making-Decorrelation-and-Topic-Selection-Friends.ipynb) -- reproduction of a very complicated experiment on automatically learning optimal number of topics from the collection. Hurdle is -- both needed regularizers when working together nullify token-topic matrix.
2323

2424
----
2525
P.S. All the guides are supposed to contain **working** examples of the library code.
26-
If you happen to find code that is no longer works please write about it in the library issues.
26+
If you happen to find code that is no longer works, please write about it in the library issues.
2727
We will try to resolve it as soon as possible and plan to include fixes in the nearest releases.

0 commit comments

Comments
 (0)