Install docx2txt on Raspberry Pi

The docx2txt is a command line tool that allows to convert MS Word (DOCX) files to plain text files while preserves some formatting. This tool requires the Perl interpreter.

This tutorial demonstrates how to install docx2txt on Raspberry Pi.

Use SSH to connect to Raspberry Pi. Run the following commands to update the package lists and install docx2txt:

sudo apt update
sudo apt install -y docx2txt

Now download DOCX file from the Internet for testing:

wget -O test.docx https://raw.githubusercontent.com/dbashford/textract/master/test/files/docx.docx

Execute the docx2txt command to convert DOCX file to plain text file:

docx2txt test.docx test.txt

We can view the content of a file:

cat test.txt
This is a test
Just so you know:
...........

It is possible to write results to standard output by specifying a dash (-) as the output file name:

docx2txt test.docx -

If the docx2txt is no need anymore, you can remove it with command:

sudo apt purge --autoremove -y docx2txt

Leave a Comment

Your email address will not be published. Required fields are marked *