DeepSpeech is a speech to text (STT) or automatic speech recognition (ASR) engine developed by Mozilla. It allows recognizing a speech and convert spoken words into text. DeepSpeech is an...
The imagededup is a Python package that allows to find exact and near duplicate images in the collection of images. It can be useful to find and remove duplicate images...
Neural style transfer (NST) is a process which adapts the style of one image to another image. NST uses two images: a content image and a style image. Images are...
A binary number system is a numeral system with two as the base. It uses only two digits: 0 and 1. Programming languages provides various methods how to convert decimal...
Working with file system sometimes we need to find all files in directory and subdirectories. Programming languages provides various methods to do that. A table includes a links to posts...
Color quantization is the process that is used to reduce the number of colors in an image while preserving the visual appearance of the image. The objective is to reproduce...
Working with image processing sometimes we need to count number of unique colors in an image. Programming languages provides various methods to do that. A table includes a links to...
Netron is an open-source tool which allows visualizing neural network, deep learning and machine learning models. Netron allows analyzing model structure and ensure it matches your expected design. It supports...
YOLO (You Only Look Once) is an object detection algorithm that can be used to detect, classify, and track objects in near real-time. The first research paper about YOLO was...
Tesseract is an open-source optical text recognition (OCR) engine that allows to recognize the text in image. Tesseract supports more than 100 languages. Tesseract can be used with many programming...