FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER.

TRANSCRIPT

  • LEX. Underpinning a lot of your writing is this sense that we’re screwed but it just feels like it’s an engineering problem I don’t understand why we’re screwed it it we time and time again Humanity has gotten itself into trouble and figured out a way to get out of the trouble.
  • ROMAN. We are in a situation where people making more capable systems just need more resources they don’t need to invent anything in my opinion some will disagree but so far at least I don’t see diminishing returns if you have 10x compute you’ll get better performance the same doesn’t apply to safety if you give uh Mei or any other organization 10 times the money they don’t output 10 times the safety and the Gap be between capabilities and safety becomes bigger and bigger all the time so it’s hard to be completely optimistic about our results here I can name 10 excellent breakthrough papers in machine learning I would struggle to name equally important breakthroughs in safety a lot of times a safety paper will propose a toy solution and point out 10 new problems discovered as a result it’s like this fractal you’re zooming in and you see more problems and it’s infinite in all directions does this apply to other Technologies or is this is this unique to AI where safety is always lagging behind so I guess we can look at related Technologies with cyber security right we we did manage to have Banks and casinos and Bitcoin so you can have secure narrow systems which are doing okay uh narrow attacks on them fail but you can always go outside outside of a box so if I can’t hack your Bitcoin I can hack you so there is always something if I really want it I will find a different way we talk about uh guard rails for AI well that’s a fence I can dig a tunnel under it I can jump over it I can climb it I can walk around it you may have a very nice guard rail but in a real world it’s not a permanent guarantee of safety and again this is the fundamental difference we are not saying we need to be 90% safe to get those trillions of dollars of benefit we need to be 100% indefinitely or we might lose the principle.

Learn More:

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER.