AI Safety — Keamanan AI

Bidang riset yang mempelajari risiko AI (existential, alignment, misuse) dan bagaimana membangun AI yang aman dan bermanfaat. Termasuk technical safety, governance, dan ethics.

AI safety: alignment, robustness, interpretability, misuse prevention, value alignment, governance. Organisasi: Anthropic, OpenAI Safety, MIRI, ARC, GovAI, FLI.

Also known as: keamanan AI
Print

AI Safety

Definisi

AI safety adalah riset dan praktik untuk memastikan sistem AI aman, bermanfaat, dan terkendali — baik dalam jangka pendek (sistem sekarang) maupun jangka panjang (AGI, superintelligence).

Topik

  • Alignment — AI mengejar tujuan benar
  • Robustness — tahan terhadap adversarial input
  • Interpretability — memahami internal model
  • Misuse prevention — deepfake, senjata otonom, bioweapon
  • Governance — regulasi, standar
  • Existential risk — apakah superintelligent AI mengancam umat manusia

Pandangan Tokoh

  • Geoffrey Hinton — khawatir, mundur dari Google 2023
  • Yoshua Bengio — khawatir
  • Yann LeCun — skeptis, tidak percaya LLM bisa AGI
  • Sam Altman — percaya alignment bisa dipecahkan
  • Demis Hassabis — hati-hati, DeepMind fokus safety
  • Gary Marcus — skeptis LLM, butuh pendekatan baru

Connected to

Not yet written

The following pages are referenced but don't exist yet — they'd make good future additions.

  • /concepts/alignment

References

  1. Wikipedia

Type at least 2 characters to search.

Press to navigate, to open, esc to close.