Model Evaluation For Extreme Risks of AI | Google DeepMind and OpenAI Paper. WES ROTH. – blog.biocomm.ai

View Larger Image

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER.

So today, DeepMind drops a new paper, an early warning system for novel AI risk. New research proposes a framework for evaluating general purpose models against novel threats. There’s some pretty big implications here.

RELATED VIDEOS: “We have no moat” – Google • Google AI Documen… Tree of Thoughts • Tree of Thoughts … Governance of Superintelligence by OpenAI • Governance of Sup… LINKS: Jack Clark Twitter: https://twitter.com/jackclarksf GPT-4 System Card https://cdn.openai.com/papers/gpt-4-s… Tree of Thoughts: https://arxiv.org/pdf/2305.10601.pdf Governance of superintelligence https://openai.com/blog/governance-of… OpenAI Hide and Seek https://openai.com/research/emergent-… Live View of 25 AI Agents in a Village: https://reverie.herokuapp.com/arXiv_D… Less Wrong Post: https://www.lesswrong.com/posts/FdQzA… DeepMind Blog: https://www.deepmind.com/blog/an-earl… Model evaluation for extreme risks https://arxiv.org/abs/2305.15324

TIMELINE:

[00:00] Intro
[00:31] Recap
[01:27] OpenAI Proposal
[02:21] Google DeepMind Paper
[04:46] Abrupt Emergence
[07:48] Existing Benchmarks
[09:08] “Frontier”
[13:35] Alignment Eval
[15:09] Dangerous Capabilities
[19:12] Safety Workflow
[21:25] Power Seeking AI
[24:05] Unknown Capabilities
[30:43] Hazards

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER.

Peter A. Jensen2023-10-09T07:20:28+00:00May 26, 2023|

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT.

IMPORTANT COPYRIGHT DISCLAIMER. THIS AI SAFETY BLOG IS FOR EDUCATIONAL PURPOSES ONLY AND KNOWLEDGE SHARING IN THE GENERAL PUBLIC INTEREST ONLY. This free and open not-for-profit ‘First do no harm.’ AI-safety blog is curated and organised by BiocommAI. Some of the following selected stand-out information is copyrighted and is CITED WITH LINK-OUT to the respective publisher sources. These vital public interest stories are selected and presented to serve the global public humanitarian interest for educational and knowledge sharing purposes regarding the EXISTENTIAL THREAT TO HUMANITY OF THE PROLIFERATION OF UNCONTROLLED, UNCONTAINED, UNSAFE AND UNREGULATED AI TECHNOLOGY. Copyrights owned by publishing sources are respectfully cited by the LINK-OUT to all sources. To request a takedown or update please contact: info@biocomm.ai

IMPORTANT DISCLOSURE: None of the information is this blog is meant to be construed as investment advice. This blog is for educational and knowledge sharing purposes only. Opinions expressed are based upon information considered reliable, but this blog does not warrant its completeness or accuracy, and it should not be relied upon as such- always do your own due diligence. This blog is not under any obligation to update or correct any information provided. Statements and opinions are subject to change without notice. No compensation is received for the opinions expressed. Past performance is not indicative of future results. This blog does not relate to any specific outcome or profit. You should be aware of the real risk of loss in following any strategy or investment in AI business opportunities or products. Strategies or investments discussed may fluctuate in price or value. Investors may get back less than invested. Information or strategies mentioned or referenced in this blog may not be relevant for investment analysis. Always seek advice from your own financial or investment adviser.

Copyright 2024 | All Rights Reserved | BiocommAI Limited | 1st Floor, 9 Exchange Place, IFSC, Dublin 1, D01 X8H2, Ireland