Computer Vision (CV)

Sub-bidang AI yang memungkinkan mesin menginterpretasikan gambar dan video. Aplikasi: object detection, face recognition, OCR, medical imaging, autonomous driving.

CV modern: CNN (2012+), Vision Transformer (2020+), multimodal vision-language models. Dataset benchmark: ImageNet, COCO, ADE20K.

Also known as: CV, visi komputer
Print

Computer Vision

Definisi

Computer vision adalah sub-bidang AI yang bertujuan memberi mesin kemampuan “melihat” — menginterpretasikan gambar, video, dan visual lainnya.

Tugas

  • Image classification (what is in the image?)
  • Object detection (where is it?)
  • Semantic segmentation (per-pixel label)
  • Instance segmentation
  • Face recognition
  • Pose estimation
  • Optical flow
  • 3D reconstruction
  • Generative (text-to-image, image-to-image)

Connected to

Not yet written

The following pages are referenced but don't exist yet — they'd make good future additions.

  • /concepts/convolutional-neural-network

References

  1. Wikipedia

Type at least 2 characters to search.

Press to navigate, to open, esc to close.