Others
Locally Linear Embedding
EigenFace
Viola-Jones object detection framework
The Viola-Jones object detection framework is the first object detection framework to provide competitive object detection rates in real-time proposed in 2001 by Paul Viola and Michael Jones.
The characteristics of Viola-Jones algorithm which make it a good detection algorithm are:
-
Robust - very high detection rate (true-positive rate) & very low false-positive rate always.
-
Real time - For practical applications at least 2 frames per second must be processed.
-
Face detection only (not recognition) - The goal is to distinguish faces from non-faces (detection is the first step in the recognition process).
The algorithm has four stages:
- Haar Feature Selection
- Creating an Integral Image
- Adaboost Training
- Cascading Classifiers
The features sought by the detection framework universally involve the sums of image pixels within rectangular areas. As such, they bear some resemblance to Haar basis functions, which have been used previously in the realm of image-based object detection. However, since the features used by Viola and Jones all rely on more than one rectangular area, they are generally more complex. The figure on the right illustrates the four different types of features used in the framework. The value of any given feature is the sum of the pixels within clear rectangles subtracted from the sum of the pixels within shaded rectangles. Rectangular features of this sort are primitive when compared to alternatives such as steerable filters. Although they are sensitive to vertical and horizontal features, their feedback is considerably coarser.
single-image super-resolution (SISR)
create a large-sized image from a low- resolution image
EnhanceNet-PAT does not attempt pixel-perfect reconstruction, but rather aims for faithful texture synthesis
By detecting and generating patterns in a low-resolution image and applying these patterns in the upsampling process, EnhanceNet-PAT adds extra pixels to the low-resolution image
https://webdav.tue.mpg.de/pixel/enhancenet
https://github.com/msmsajjadi/EnhanceNet-Code
Waifu2
waifu2x is an image scaling and noise reduction program for anime-style art and other types of photos.
waifu2x was inspired by Super-ResolutionConvolutional Neural Network(SRCNN).It uses NvidiaCUDA for computing, although alternative implementations that allow for OpenCL and Vulkan have been created.
https://github.com/nagadomi/waifu2x
https://en.wikipedia.org/wiki/Comparison_gallery_of_image_scaling_algorithms
https://towardsdatascience.com/deep-learning-based-super-resolution-with-opencv-4fd736678066
Rasterization
Rasterisation (or rasterization) is the task of taking an image described in a vector graphics format (shapes) and converting it into a raster image (pixels or dots) for output on a video display or printer, or for storage in a bitmap file format. It refers to both rasterisation of models and 2D rendering primitives such as polygons, line segments, etc.
https://en.wikipedia.org/wiki/Rasterisation
Dithering
Remove fuzziness in medical or other images (uses MST)
Video Multimethod Assessment Fusion (VMAF)
Video Multimethod Assessment Fusion (VMAF)is an objective full-reference video quality metric developed by Netflix in cooperation with the University of Southern California and the Laboratory for Image and Video Engineering (LIVE) at The University of Texas at Austin. It predicts subjective video quality based on a reference and distorted video sequence. The metric can be used to evaluate the quality of different video codecs, encoders, encoding settings, or transmission variants.
Datasets
- ImageNet
Vertex AI Matching Engine
Vertex AI Matching Engine provides the industry's leading high scale, low latency, vector-similarity matching (also known as approximate nearest neighbor) service, and industry-leading algorithms to train semantic embeddings for similarity-matching use cases.
https://cloud.google.com/vertex-ai/docs/matching-engine/overview
Liveness detection
https://towardsdatascience.com/implementing-liveness-detection-with-google-ml-kit-5e8c9f6dba45
Border Detection / Segementation
Comic book panel segmentation • Max Halford