COMPUTER VISION notes
COMPUTER VISION notes
COMPUTER VISION
1. What is Computer Vision?
Computer vision is a branch in the Domain of AI that enables computers to analyze meaningful information from
images, videos, and other visual inputs.
2. Applications of Computer Vision
Facial recognition
The most frequently used technology is smartphones. It is a technology to remember and verify a person,
object, etc from the visuals from the given pre-defined data. Such kinds of mechanics are often used for
security and safety purposes.
For eg: Face security lock-in devices and traffic cameras are some examples using facial recognition.
Facial filters
Modern days social media apps like Snapchat and Instagram use such kinds of technology that extract facial
landmarks and process them using AI to get the best result
Goggle lens
To search data, Google uses Computer vision for capturing and analyzing different features of the input
image to the database of images and then gives us the search.
Automotive
The machinery in industries is now using Computer vision. Automated cars are equipped with sensors and
software which can detect the 360 degrees of movements determine the location, detect objects and establish
the depth or dimensions of the virtual world.
For eg: Companies like Tesla are now interested in developing self-driving cars
Medical Imaging
For the last decades, computer vision medical imaging application has been a trustworthy help for
physicians and doctors. It creates and analyzes images and helps doctors with their interpretation.
The application is used to read and convert 2D scan images into interactive 3D models.
3. Computer Vision Tasks
The Application of the computer is performed by certain tasks on the data or input provided by the user
so it can process and analyze the situation and predict the outcome.
4. Basics of Images
The word “pixel” means a picture element.
Pixels
1
In genral term, The more pixels you have, the more closely the image resembles the
original.
Resolution
Computer systems only work in the form of ones and zeros or binary systems.
Each bit in a computer system can have either a zero or a one. Each pixel uses
1 byte of an image each bit can have two possible values which tells us that the
8 bit can have 255 possibilities of values that start from 0 and ends at 255.
6. Grayscale Images
Grayscale images are images which have a range of shades of gray without apparent
colour. The lightest shade is white total presence of colour or 255 and darkest colour is
black at 0.
Intermerdiate shades of gray have equal brightness levels of the three primary colours
RBG.
The computers store the images we see in the form of these numbers.
7. RGB IMAGES
RBG colurs
All the coloured images are made up of three primary colours Red, Green and Blue.
All the other colour are formed by using these primary colours at different proportions.
Computer stores RGB Images in three different channels called the R channel, G
channel and the B channel.
Image Features