We could load data from: - epub: should be easily doable, it’s HTML in a ZIP. What would be nice would be to allow import of all the HTML pages in one article, each one in an independent article, each one in a group of related articles. The only issue is to load the ZIP while being protected from ZIP bombs. - odt/docx: it’s XML in a ZIP. I don’t think converting to HTML can be part of this project. We could have a service based on LibreOffice that does the conversion to HTML for us. - pdf: probably the hardest one.
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be still under discussion. The issue was opened by Jenselme and has received 2 comments.