-
Notifications
You must be signed in to change notification settings - Fork 241
Open
Description
I followed your step to train my own S2V for my corpus on my customized NER model, thill step 2 everything is fine,.
corpusMODELV05.spacy is made and also corpusMODELV05-1.s2v
but in step 3 I faced with this error
ℹ Using 1 input files
✔ Created output directory data/S2VVocabMODELV05
ℹ Creating vocabulary counts
cat data\S2vcorpusMODELV05\corpusMODELV05-1.s2v | data/glove.6B.200d.txt/vocab_count -min-count 5 -verbose 2 > data\S2VVocabMODELV05\vocab.txt
✘ Failed creating vocab counts
I am working on Win 10 machine and have used this version of the glove
Wikipedia 2014 + Gigaword 5 (6B tokens, 400K vocab, uncased, 50d, 100d, 200d, & 300d vectors, 822 MB download): glove.6B.zip
https://nlp.stanford.edu/projects/glove/
it seems the number of VOC in
glove.6B.200d.txt/vocab_count is not in line with something
can someone help me ?
many thanks in advance
saimmehmood
Metadata
Metadata
Assignees
Labels
No labels