
First, do no harm.
1,500+ Posts…
Free knowledge sharing for Safe AI. Not for profit. Linkouts to sources provided. Ads are likely to appear on link-outs (zero benefit to this journal publisher)
BBC HARDtalk. Professor of Computer Science at University of California, Berkeley – Stuart Russell. 14 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. BBC HARDtalk. Professor of Computer Science at University of California, Berkeley - Stuart Russell. 14 OCT 2023. What is the most serious existential threat facing humanity? Artificial Intelligence, warned the physicist Stephen [...]
OpenAI Alignment Team Lead: Jan Leike ‘s P(doom) is, like, 10-90%.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. OpenAI Alignment Team Lead: Jan Leike 's P(doom) is, like, 10-90%. OpenAI Alignment Team Lead @JanLeike's P(doom) is 10-90%. pic.twitter.com/CFuyQxQ0hf — Liron Shapira (@liron) August [...]
10–25% PROBABILITY AI IS THE END OF HUMANITY. Dario Amodei’s P(doom) is 10–25%. CEO and Co-Founder of AnthropicAI.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. 10–25% PROBABILITY AI IS THE END OF HUMANITY. Dario Amodei's P(doom) is 10–25%. CEO and Co-Founder of Anthropic AI. Dario Amodei's P(doom) is 10–25%. CEO and Co-Founder [...]
AI Safety Weekly. AI Tracker #6: Russian Roulette, Anyone?
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. AI Tracker #6: Russian Roulette, Anyone?A lot of high-quality AI safety conversation happens on Twitter. Unfortunately, this leads to great write-ups & content being buried in days. We've set up AI Tracker to [...]
POLICY BRIEF. How to Govern AI in and Age of Global Tension. Boston Global Forum Special Report, The Rīga Conference 2023.
POLICY BRIEF. How to Govern AI in and Age of Global Tension. Boston Global Forum Special Report, The Rīga Conference 2023. DOWNLOAD THE REPORT 12. October, 2023 HOW TO GOVERN AI IN AN AGE OF GLOBAL TENSION Global tensions in the 21st century have undergone a notable transformation. While not [...]
Existential Risk Observatory. Does interpretability change everything? 07 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Existential Risk Observatory. Does interpretability change everything? 07 OCT 2023. Significant news: two breakthroughs in interpretability, a subfield of AI Alignment, came out this week. What is AI Alignment again? AI Alignment [...]
BBC NEWS. How a chatbot encouraged a man who wanted to kill the Queen. 06 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. BBC NEWS. How a chatbot encouraged a man who wanted to kill the Queen. 06 OCT 2023. By Tom Singleton, Tom Gerken & Liv McMahon Technology reporters, BBC News The case of [...]
REPORT. Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! 05 OCT.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. A very good read from a respected source! “When companies allow for fine-tuning and the creation of customized versions of the technology, they open a Pandora’s box of [...]
THE WASHINGTON POST. ChatGPT provided better customer service than his staff. He fired them. Artificial intelligence is rapidly changing the world of customer service and call centers. Developing economies worry they’ll face the brunt. October 3, 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ChatGPT provided better customer service than his staff. He fired them. Artificial intelligence is rapidly changing the world of customer service and call centers. Developing economies worry they’ll face the brunt. By [...]
REPORT. Low-Resource Languages Jailbreak GPT-4. 03 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Low-Resource Languages Jailbreak GPT-4 Abstract AI safety training and red-teaming of large language models (LLMs) are measures to mitigate the generation of unsafe content. Our work exposes the inherent cross-lingual vulnerability of [...]
Piers Morgan vs Yuval Noah Harari On AI | The Full Interview | 2 Oct 2023
Piers Morgan vs Yuval Noah Harari On AI | The Full Interview | 2 Oct 2023 0:00 well joining me now is historian and 0:01 philosopher juval Noah Harari ju great 0:04 to see you here in the Piers Morgan 0:06 uncensored Studio last time I 0:08 [...]
My views on “doom”. Paul Christiano. AI Alignment. 3 min read. 27 APRIL
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. My views on “doom”. Paul Christiano. AI Alignment. 3 min read. 27 APRIL Learn more: AI Alignment I’m often asked: “what’s the probability of a really bad outcome from AI?” There are [...]
Future of Life Institute. REGULATE AI NOW.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. "Lights out for all of us." [Translation: Death for all humanity.] TRANSCRIPT. In March 2023 an open letter sounded the alarm on the training of giant [...]
Future of Life Institute Newsletter: Our Pause Letter, Six Months Later. October 01, 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. TRANSCRIPT. In March 2023 an open letter sounded the alarm on the training of giant AI experiments. It was signed by over 30,000 individuals including more than 2,000 industry leaders and more than [...]
“If the humans cannot unite then the alien intelligence will definitely win because […] then it’s a race to the bottom.” Yuval Noah Harari Audience Q&A @ CogX Festival 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. "If the humans cannot unite then the alien intelligence will definitely win because [...] then it's a race to the bottom." - Yuval Noah Hariri "Our superpower has always been [...]
Stuart Russell. Is AI an Existential Threat to Humanity? Southbank Centre, London. 28 SEPT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. 7:19 collaboration so thank you all so much for joining us I want to start by asking you to make your put put your stake in 7:26 [...]
REPORT. Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality Harvard Business School Technology & Operations Mgt. Unit Working Paper No. 24-013 58 [...]
Meta. Introducing New AI Experiences Across Our Family of Apps and Devices. September 27, 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Meta Connect 2023: Everything Revealed in 10 Minutes Meta. Introducing New AI Experiences Across Our Family of Apps and Devices. September 27, 2023. Takeaways We’re starting to [...]
Extraordinary claims require extraordinary evidence. – Carl Sagan. No evidence here. None whatsoever… “Don’t Fear the Terminator”
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. A worrying read from an expert thought leader! (2019) Note: Unfortunately, four years hence, no evidence has yet been provided to support some of these extraordinary claims. The absence of scientific evidence, [...]
ANTHROPIC. Expanding access to safer AI with Amazon. Sep 25, 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ANTHROPIC. Expanding access to safer AI with Amazon. Sep 25, 2023. Today, we’re announcing that Amazon will invest up to $4 billion in Anthropic. The agreement is part of a broader collaboration [...]
VOX. The $1 billion gamble to ensure AI doesn’t destroy humanity. The founders of Anthropic quit OpenAI to make a safe AI company. It’s easier said than done. 25 SEPT.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. VOX. The $1 billion gamble to ensure AI doesn’t destroy humanity The founders of Anthropic quit OpenAI to make a safe AI company. It’s easier said than done. By Dylan Matthewsdylan@vox.com Updated [...]
WARNING. EXISTENTIAL RISK. The Next Pandemic Will Certainly Come Someday. (and COVID is not over)
WARNING. EXISTENTIAL RISK. Natural or Man-made... the NEXT PANDEMIC is CERTAINLY Coming. THE MAN-MADE RISK: Currently unregulated easily available and uncontained AI Technology combined with powerful advances in Biotechnology create an existential threat to billions of people from potential bio-terrorist actors' creation of synthetic pathogens, poisons and other unpredictable [...]
THE WASHINGTON POST. A flood of new AI products just arrived — whether we’re ready or not. 22 SEPT 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. THE WASHINGTON POST. A flood of new AI products just arrived — whether we’re ready or not. 22 SEPT 2023. Google, Microsoft, Amazon and OpenAI are in an arms race to push [...]