Homework-3
To run the assignment, execute python3.8 HW3/main.py
the file "main.py" is divided into 3 sections : 0. DEVEL (Alpha testing)
- FULL_DATASET
- SMALL_DATASET
Execute the code in these sections by toggling the corresponding flag to true.
Select featurizers by toggling the following flags in main.py:
- ZEROGRAM
- W_SYNCTACTIC_CHUNK_WORD_FEATURIZER
- W_SYNTACTIC_CHUNK_TAG_FEATURIZER
- W_WORD_TAG_FEATURIZER
- W_1_GRAM_TAG_FEATURIZER
- W_1_GRAM_SYNCTACTIC_CHUNK_FEATURIZER
- W_2_GRAM_TAG_FEATURIZER
- W_2_GRAM_FORWARD_TAG_FEATURIZER
- W_2_GRAM_SYNCTACTIC_CHUNK_FEATURIZER
- W_2_GRAM_FORWARD_SYNCTACTIC_CHUNK_FEATURIZER
- W_3_GRAM_TAG_FEATURIZER
- W_3_GRAM_FORWARD_TAG_FEATURIZER
- W_3_GRAM_AROUND_TAG_FEATURIZER
- W_3_GRAM_SYNCTACTIC_CHUNK_FEATURIZER
- W_3_GRAM_FORWARD_SYNCTACTIC_CHUNK_FEATURIZER
- W_3_GRAM_AROUND_SYNCTACTIC_CHUNK_FEATURIZER
- W_IS_NUMBER_FEATURIZER
- W_IS_CAPITALIZED_FEATURIZER
- W_IS_SPLIT_FEATURIZER
You can also toggle groups of featurizers using the following flags:
- WORD_PROPERTIES - Toggles feature # 17, 18, 19 in list above
- ONEGRAM - Toggles all OneGram featurizers (#2, #3, #4, #5, #6, #7 in list above)
- BIGRAM - Toggles all BiGram featurizers (#7, #8, #9, #10 in list above)
- TRIGRAM - Toggles TriGram featurizers (#11, #12, #13, #14, #15, #16 in list above)
- ALL_FEATURES - Toggles all features.
Other Flags:
- EVAL_TRAIN: Toggles if train dataset is evaluated or not.
- EVAL_TRAIN_SMALL: Toggles if train small dataset is evaluated or not.
- FULL_DATASET: Toggles if full dataset is used or not.
- SMALL_DATASET: Toggles if small dataset is used or not.
Libraries required:
- numpy
- tqdm
- multiprocessing
- joblib
Homework-2
Libraries required:
- numpy
- tqdm
- multiprocessing
- joblib
HW2:
To run the assignment, execute nohup python3.8 HW2/main.py > nohup.out &
the file "main.py" is divided into 3 sections : 0. DEVEL (Alpha testing)
- BIGRAM 1.1 TRAIN 1.2 DEV 1.3 TEST
- TRIGRAM 2.1 TRAIN 2.2 DEV 2.3 TEST
Execute the code in these sections by toggling the corresponding flag to true.
Homework-1:
To run the assignment, execute nohup python3.8 HW1/main.py > nohup.out &
the file "main.py" is divided into 3 sections :
- TEST (Devo testing)
- DEVELOP (Attempting the questions)
- BRING_IT_TOGETHER (Attempting the bonus questions 4.1 and 4.2 along with executing on the test set)
Execute the code in these sections by toggling the corresponding flag to true.