Chelsea Manning Trial Transcript Index Generator

This project generates an index for the Chelsea Manning Trial Transcript. The canonical version of the generated index can be found on the Internet Archive here.

How to Use

Download this repository into a directory of your choice. We shall refer to this directory as $DIR for the rest of this explanation.
Create a directort $DIR/dictionaries, and put the following dictionaries into them:
- Oxford Dictionary of Computer Science; ebook; 2016 ISBN 978-0-19-100288-5 (can be found in various places, both legally and illegally).
- DOD Dictionary of Military and Associated Terms; pdf; May 2019; https://www.jcs.mil/Portals/36/Documents/Doctrine/pubs/dictionary.pdf?ver=2019-05-29-162249-290
- The Jargon File Glossary; webpage; http://catb.org/jargon/html/go01.html
- NIST Glossary of Key Information Security Terms; pdf; May 2013; https://csrc.nist.gov/publications/detail/nistir/7298/rev-2/final
Convert the PDFs to TXTs in the same directory, using Adobe Acrobat Pro using the following methods. If you don't have Acrobat Pro, this process can't be recreated with any guarantee, sorry.
- DOD Dictionary of Military and Associated Terms: Save as Accessible Text
- NIST Glossary of Key Information Security Terms: Save as Plain Text (HTML/ASCII Encoding)
Connect to the internet.
From within $DIR the extractor script: python3 src/run.py. Follow the prompts. To recreate the content on archive.org, pick your base url to be https://archive.org/download/

The output of this whole process can be found in $DIR/output, and will have the following sort of structure:

output
├── usvmanning-index
│   ├── a_certificate.html
│   ├── a_life.html
│   ├── acceptability.html
│   ...
│   ├── term_list.html
│   ...
├── usvmanning1
│   ├── page_0000.html
│   ├── page_0001.html
│   ...
├── usvmanning10
├── usvmanning100
├── usvmanning101
...

The various directories named usvmanningNNN are the browsable HTML versions of the PDFs on archive.org, while usvmanning-index contains the generated index pages. The most important one being term_list.html, which contains the list of terms that are in the index, and links to the individual pages for those terms, which then link to the HTML versions of the PDFs.

Contact

If you have any questions or need assistance with any of this,

This index and the software for it was created by volunteers at Queerious Labs in San Francisco. All inquiries about the project should be directed to Beka Valentine, who can be contacted via email at beka@queeriouslabs.com or via Twitter at @beka_valentine.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Chelsea Manning Trial Transcript Index Generator

How to Use

Contact

About

Uh oh!

Releases

Packages

Languages

queeriouslabs/chelsea-manning-transcript-index

Folders and files

Latest commit

History

Repository files navigation

Chelsea Manning Trial Transcript Index Generator

How to Use

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages