AI Glossary/Computer Vision

What Is Computer Vision?

Definition

Computer vision is a field of artificial intelligence that enables machines to interpret, analyze, and understand visual information from digital images, videos, and other visual inputs, allowing computers to derive meaningful insights from visual data.

How Computer Vision Works

Computer vision teaches machines to see and understand the visual world the way humans do — but often faster, more consistently, and at greater scale. The field encompasses tasks like image classification (what is in this image?), object detection (where are objects in this image?), semantic segmentation (labeling every pixel), facial recognition, pose estimation, optical character recognition (OCR), and video analysis. Modern computer vision is primarily powered by deep learning, particularly convolutional neural networks (CNNs) and more recently vision transformers (ViTs). These models learn to extract visual features from raw pixel data through training on large labeled image datasets. The process typically involves feature extraction at multiple scales, from low-level edges and textures to high-level objects and scenes. Computer vision has enabled transformative applications across industries: autonomous driving, medical imaging diagnostics, manufacturing quality control, augmented reality, satellite imagery analysis, and content moderation. With the rise of multimodal AI models like GPT-4V and Gemini, computer vision is increasingly integrated with language understanding, enabling systems that can both see and reason about visual content.

Real-World Examples

1

A Tesla vehicle using computer vision to detect lane boundaries, traffic lights, and pedestrians from eight cameras in real time

2

A pathology lab using AI-powered computer vision to detect cancerous cells in tissue slide images with higher accuracy than manual review

3

Amazon Go stores using computer vision to track which products shoppers pick up and automatically charge their accounts without checkout

Recommended Tools

Related Terms