AI Safety Gridworlds

92,009

3,917 14

Publicado 2018-05-25

Got an AI safety idea? Now you can test it out! A recent paper from DeepMind sets out some environments for evaluating the safety of AI systems, and the code is on GitHub.

The Computerphile video: • AI Gridworlds - Computerphile
The EXTRA BITS video, with more detail: • EXTRA BITS: AI Gridworlds - Computerp...

The paper: arxiv.org/pdf/1711.09883.pdf
The GitHub repos: github.com/deepmind/ai-safety-gridworlds

www.patreon.com/robertskmiles
With thanks to my wonderful Patreon supporters:

- Jason Hise
- Steef
- Cooper Lawton
- Jason Strack
- Chad Jones
- Stefan Skiles
- Jordan Medina
- Manuel Weichselbaum
- Scott Worley
- JJ Hepboin
- Alex Flint
- Justin Courtright
- James McCuen
- Richárd Nagyfi
- Ville Ahlgren
- Alec Johnson
- Simon Strandgaard
- Joshua Richardson
- Jonatan R
- Michael Greve
- The Guru Of Vision
- Fabrizio Pisani
- Alexander Hartvig Nielsen
- Volodymyr
- David Tjäder
- Paul Mason
- Ben Scanlon
- Julius Brash
- Mike Bird
- Tom O'Connor
- Gunnar Guðvarðarson
- Shevis Johnson
- Erik de Bruijn
- Robin Green
- Alexei Vasilkov
- Maksym Taran
- Laura Olds
- Jon Halliday
- Robert Werner
- Paul Hobbs
- Jeroen De Dauw
- Enrico Ros
- Tim Neilson
- Eric Scammell
- christopher dasenbrock
- Igor Keller
- William Hendley
- DGJono
- robertvanduursen
- Scott Stevens
- Michael Ore
- Dmitri Afanasjev
- Brian Sandberg
- Einar Ueland
- Marcel Ward
- Andrew Weir
- Taylor Smith
- Ben Archer
- Scott McCarthy
- Kabs Kabs
- Phil
- Tendayi Mawushe
- Gabriel Behm
- Anne Kohlbrenner
- Jake Fish
- Bjorn Nyblad
- Jussi Männistö
- Mr Fantastic
- Matanya Loewenthal
- Wr4thon
- Dave Tapley
- Archy de Berker
- Kevin
- Marc Pauly
- Joshua Pratt
- Andy Kobre
- Brian Gillespie
- Martin Wind
- Peggy Youell
- Poker Chen
- pmilian
- Kees
- Darko Sperac
- Paul Moffat
- Jelle Langen
- Lars Scholz
- Anders Öhrt
- Lupuleasa Ionuț
- Marco Tiraboschi
- Peter Kjeld Andersen
- Michael Kuhinica
- Fraser Cain
- Robin Scharf
- Oren Milman

Todos los comentarios (21)

@aretorta hace 6 años

I laughed way too hard at the "unplugging itself to plug in the vacuum cleaner" analogy.
@nova_vista hace 4 años

"it will volkswagen you" LOL
@duncanthaw6858 hace 6 años

I SACRIFICE ALL MY HP TO VACUUM THE LAST SPECK OF DUST IN THE HOUSE
@willdbeast1523 hace 6 años

There will be another video "if people want"? The people want.
@Njald hace 6 años

I Love the question at the end on "if we would like to see more". Of course we would. We're not here because we don't want to see more Robert Miles
@fermibubbles9375 hace 6 años

rob miles & isaac arthur collaboration is nerd heaven
@quangho8120 hace 4 años

Love the Tron music at the end btw
@alecjohnson55 hace 6 años

I love the ukelele cover of Daft Punk going on there. Are the outro songs played by you, Rob?
@Schwallex hace 6 años

OMFG you have a channel of your own and I only learn of it today. After many years of longing and begging for another tiny little breadcrumb from Brady I stumble upon a ten-storey cake with a watermelon on top. There goes my night. And my waistline.
@paulbottomley42 hace 3 años

I appreciate the green colour cast to this video that makes it seem like you're broadcasting from within The Matrix
@faerly hace 6 años

Great video as always, especially appreciated the tron legacy reference! Most people don't even seem to remember it exists so seeing your channel reference my favourite movie twice bad been good :)
@firefoxmetzger9063 hace 6 años

Regarding the exploration vs exploitation trade-off: I feel you are a bit imprecise with the terms at 5:10 ish. There is a massive difference between knowing that you will have N more trials or having infinite trials. If the number of trials (overall or remaining) is bounded then we can solve this optimally. It might not always be computationally feasible right now, but at least we know how to do it in theory. With infinite trials on the other hand there is no harm in trying a new thing each time as you always have infinitely many trials left to later on exploit your findings. In this case it is not clear how to optimally trade-off exploration vs exploitation.
@SJNaka101 hace 6 años

The Grid. A digital frontier. I tried to picture clusters of information as they moved through the computer. What did they look like? Ships? motorcycles? Were the circuits like freeways? I kept dreaming of a world I thought I'd never see. And then, one day... Edit: Hey rob, nobody else has done a cover of the grid on ukulele. Would love to have an mp3 of that! It sounds great
@user-wd4yj7ck6m hace 6 años

Thanks for posting these, very interesting as always!
@DrDress hace 3 años

3:38 "That's what the agent really is". That send chills down my spine for some reason.
@ONDANOTA hace 6 años

you should do a Ted talk
@kingxerocole4616 hace 6 años

What a coincidence, I was just reading the Gridworld paper this morning!
@Nurr0 hace 6 años

I've missed your videos!
@Varenon hace 3 años

So this video made me realize just how similar goals and restrictions set for A.I. are to things that trigger serotonin/oxytocin and disgust/pain respectively in organic life. The way the A.I. goes straight for reward functions over what you want them to do via setting said functions reminded me a lot of when Scientists wired up a button to a rats brain so that everytime they pressed it they'd orgasm, and they just pressed it all the time and stopped eating and drinking just pressing that button.. That helps put programming a lot more in perspective.. People do self destructive things all the time to trigger serotonin, so it's definitely important that if we are making something and can control what triggers their serotonin then we have to pay attention to what those things are..
@dangerousham3519 hace 6 años

Nice touch with the music, very on theme!