data

First version createDataset.ipynb done by Fiona

Using Python 3.11

data

PTB location: /metadata/data/corpora at machine songcpu0

extra parser data: /metadata/data/corpora/aser/datas

RL reward

reward1: design a new reward based on goal

reward2: 1. cfg rule -> generate data -> judge using llm judge, distillate cfg tree, to see the preplexity

RL model 1

collect all cfg rules appeared in corpus(only type of word, e.g. S, NP, VP, ...)
Use the collected cfg rules to generate sentence, reward based on the generation results

How to evaluate the amount of reward for a generation?

train each gold tree

ask the RL model to choice a corresponding next level nodes, compare with the ground truth

RL model 2

collect all cfg rules appeared in corpus(only actual word)
Use the collected cfg rules to generate sentence, reward based on the generation results

How to evaluate the amount of reward for a generation?

train each gold tree

ask the RL model to choice a corresponding next level word in cfg form, and ask LLM judge the perplexity ???

Todo

recreate Physics of LM 1

task	status
createDataset	Done
complete tasks.py	just created
complete cfg.py	to automate cfg summary for input corpus
complete eval.py	accept input model for test

Usage

py eval.py to perform evaluation on given corpus for it's physics on given model

Files

dir	Usage
data	source corpus for cfg rules generation
result	resulted cfg rules

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
results		results
.gitignore		.gitignore
README.md		README.md
cfg.py		cfg.py
createDataset.ipynb		createDataset.ipynb
eval.py		eval.py
example.sh		example.sh
model.py		model.py
requirements.txt		requirements.txt
tasks.py		tasks.py
训练模型.py		训练模型.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

data

RL reward

RL model 1

How to evaluate the amount of reward for a generation?

train each gold tree

RL model 2

How to evaluate the amount of reward for a generation?

train each gold tree

Todo

Usage

Files

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

1324fgg/PhysicsOfLM

Folders and files

Latest commit

History

Repository files navigation

data

RL reward

RL model 1

How to evaluate the amount of reward for a generation?

train each gold tree

RL model 2

How to evaluate the amount of reward for a generation?

train each gold tree

Todo

Usage

Files

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages