chapter06 #22
Mandeepumra
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
chapter record:
chapter06 directory
a narrative for the chapter
README.md file
environment and origins:
macOS
python version 3.8.0
docanalysis version 0.2.0
py4ami version 0.0.45
What we started with and our environment.
software used:
py4ami
docanalysis
How we set up the software:
pip install docanalysis
docanalysis --help
creation of raw HTML:
created pdf to html:
python -m py4ami.ami_pdf --inpath fulltext.pdf --outdir / --maxpage 110
tools used, comments on result.
py4ami
docanalysis
creation of sections
extraction of dictionaries
manual
abbreviation
method:
mkdir wiki_hackathon
wiki_hackathon
mkdir Chapter06
cd Chapter06
mkdir sections
cd sections
mkdir 0_main_body
docanalysis --project_name wiki_hackathon --output dict_search_5.csv --make_json dict_search_5.json --make_ami_dict entities --extract_abb ip_3_6_energy_abb
where,
--project name – the name of the project (here, wiki_hackathon)
--output - a csv for dictionary search (not of our use, but required to be created)
--make_json - just enter this. Not of current use, but required.
--make_ami_dict – uses the entities created in the above command
--extract_abb - the abbreviation dictionary that is the output.
keyword/phrase: keyword are extracted by the help of gensim method
introduction.md
table of centents.md
FAQs.md
annotation of HTML using dictionaries:
Beta Was this translation helpful? Give feedback.
All reactions