Install Tika Server Inside Docker Container in Linux

Install Tika Server Inside Docker Container in Linux

Tika Server is a web service that allows accessing Tika via a REST API interface. Tika is a toolkit for detecting and extracting metadata from different file types such as PDF, XLS, etc.

This tutorial explains how to install Tika Server inside a Docker container in the Linux. Commands have been tested on Ubuntu.

Prepare environment

Make sure you have installed Docker in your system. If you are using Ubuntu, installation instructions can be found in the post.

Install Tika Server

  • Host network

Run the following command to create a container for Tika Server that uses host network:

docker run -d --name=tika --restart=always --network=host \
    apache/tika
  • User-defined bridge network

User-defined bridge network can be used for listening on different port. By default, Tika service is listening on port 9998. It can be changed with -p option.

docker network create app-net
docker run -d --name=tika --restart=always --network=app-net \
    -p 8080:9998 \
    apache/tika

Testing Tika Server

Open a web browser and go to http://<IP_ADDRESS>:9998, where <IP_ADDRESS> is the IP address of the system. You will get a basic report of all the endpoints defined in the Tika Server.

Tika Server Inside Docker Container in Linux

Uninstall Tika Server

To completely remove Tika Server, remove its container:

docker rm --force tika

Remove Tika Server image:

docker rmi apache/tika

If a user-defined bridge network was created, you can delete it as follows:

docker network rm app-net

Leave a Comment

Cancel reply

Your email address will not be published.