-
Notifications
You must be signed in to change notification settings - Fork 4
Open
Labels
Tech debtSomething to clean upSomething to clean upenhancementNew feature or requestNew feature or request
Description
The purpose of the following code was just to convert a Word document into HTML then into Coradoc.
We should supersede this as the gem is introducing unnecessary dependencies we don't want.
module Coradoc
module Input::Docx
def self.processor_id
:docx
end
def self.processor_match?(filename)
%w[.docx .doc].any? { |i| filename.downcase.end_with?(i) }
end
def self.processor_execute(input, options = {})
image_dir = Dir.mktmpdir
options = options.merge(sourcedir: image_dir)
doc = WordToMarkdown.new(input, image_dir)
doc = Coradoc::Input::HTML.cleaner.preprocess_word_html(doc.document.html)
options = WordToMarkdown::REVERSE_MARKDOWN_OPTIONS.merge(options)
Coradoc::Input::HTML.to_coradoc(doc, options)
ensure
FileUtils.rm_rf(image_dir)
end
def self.processor_postprocess(data, options)
Coradoc::Input::HTML.processor_postprocess(data, options)
end
# This processor prefers to work on original files.
def self.processor_wants_filenames; true; end
Coradoc::Input.define(self)
end
end
Metadata
Metadata
Assignees
Labels
Tech debtSomething to clean upSomething to clean upenhancementNew feature or requestNew feature or request