I have a large collvection of LibroOffice Writer files (.ods) that I would like to convert into markdown files that looks as much as the original file in obsidian.
I have tried the pandoc route:
pandoc --extract-media . -s "myfile.odt" -t markdown -o myfile.md
The problems are:
images come out larger than in original. This coul be solved by resizing, but I dont think pandoc does that.
There does not seem to be an obvious way to choose link styling (I prefer [[link]] and ![[image]].
I foud a couple of related posts:
Things I have tried
I have searched the forum and looked into markdownload and pandoc and cannot get it to work.
What I’m trying to do
As a workaround, I am currently using Acrobat to export the PDFs to word documents. Then I am opening in libreoffice writer and selecting all and copying to Obsidian. This gives Obsidian a pretty decent quality text conversion with some extra lines where pictures were but that does not affect preview and is besides the point. My next step is to copy the imag…
The post below has an answer that suggests conversion to html and then
importing this with the officien html importing plugin.
I’m looking to automate the process of converting .doc files from Google Drive to Markdown and then transferring them into a folder within my Obsidian vault. I have a substantial number of .doc files that I’d like to integrate into my Obsidian system for better note management.
Here’s my plan of action:
Download the .doc files from Google Drive.
Convert these .doc files to Markdown.
Copy the Markdown files into a specific folder within my Obsidian vault.
I’m curious if any other forum…
I hope to use this thread to collect ideas and observations and write a guide and perhaps a python script to share with the Obsidian community.
Meanwhile questions and comments very welcome. Especially if this have been solved already, I dont wnatto duplicate anyones work.
The first question to answer is what is th best ods to html conversion? LO can save as html, but the html looks quite dirty.
I didn’t have good results with Pandoc – especially or partly because my docx files were messy.
So I used VBA macros in MS Word to convert text to suit the new environment (mainly bold text, some colours and of course the inlineshape/image references) and used another macro to save the images.
I wouldn’t recommend this route (and I certainly don’t remember the details or the (location of my) scripts) but at least this way I had some control over the outcome. I had no control with Pandoc.