Tika Server is a web service that allows accessing Tika via a REST API interface. Tika is a toolkit for detecting and extracting metadata from different file types such as PDF, XLS, etc.
This tutorial explains how to install Tika Server inside a Docker container in the Linux. Commands have been tested on Ubuntu.
Prepare environment
Make sure you have installed Docker in your system. If you are using Ubuntu, installation instructions can be found in the post.
Install Tika Server
- Host network
Run the following command to create a container for Tika Server that uses host network:
docker run -d --name=tika --restart=always --network=host \
apache/tika
- User-defined bridge network
User-defined bridge network can be used for listening on different port. By default, Tika service is listening on port 9998. It can be changed with -p
option.
docker network create app-net
docker run -d --name=tika --restart=always --network=app-net \
-p 8080:9998 \
apache/tika
Testing Tika Server
Open a web browser and go to http://<IP_ADDRESS>:9998
, where <IP_ADDRESS>
is the IP address of the system. You will get a basic report of all the endpoints defined in the Tika Server.
Uninstall Tika Server
To completely remove Tika Server, remove its container:
docker rm --force tika
Remove Tika Server image:
docker rmi apache/tika
If a user-defined bridge network was created, you can delete it as follows:
docker network rm app-net
Leave a Comment
Cancel reply