
First, do no harm.
1,500+ Posts…
Free knowledge sharing for Safe AI. Not for profit. Linkouts to sources provided. Ads are likely to appear on link-outs (zero benefit to this journal publisher)
Existential Risk Observatory. Does interpretability change everything? 07 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Existential Risk Observatory. Does interpretability change everything? 07 OCT 2023. Significant news: two breakthroughs in interpretability, a subfield of AI Alignment, came out this week. What is AI Alignment again? AI Alignment [...]
BBC NEWS. How a chatbot encouraged a man who wanted to kill the Queen. 06 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. BBC NEWS. How a chatbot encouraged a man who wanted to kill the Queen. 06 OCT 2023. By Tom Singleton, Tom Gerken & Liv McMahon Technology reporters, BBC News The case of [...]
REPORT. Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! 05 OCT.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. A very good read from a respected source! “When companies allow for fine-tuning and the creation of customized versions of the technology, they open a Pandora’s box of [...]
THE WASHINGTON POST. ChatGPT provided better customer service than his staff. He fired them. Artificial intelligence is rapidly changing the world of customer service and call centers. Developing economies worry they’ll face the brunt. October 3, 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ChatGPT provided better customer service than his staff. He fired them. Artificial intelligence is rapidly changing the world of customer service and call centers. Developing economies worry they’ll face the brunt. By [...]
REPORT. Low-Resource Languages Jailbreak GPT-4. 03 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Low-Resource Languages Jailbreak GPT-4 Abstract AI safety training and red-teaming of large language models (LLMs) are measures to mitigate the generation of unsafe content. Our work exposes the inherent cross-lingual vulnerability of [...]
Piers Morgan vs Yuval Noah Harari On AI | The Full Interview | 2 Oct 2023
Piers Morgan vs Yuval Noah Harari On AI | The Full Interview | 2 Oct 2023 0:00 well joining me now is historian and 0:01 philosopher juval Noah Harari ju great 0:04 to see you here in the Piers Morgan 0:06 uncensored Studio last time I 0:08 [...]
My views on “doom”. Paul Christiano. AI Alignment. 3 min read. 27 APRIL
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. My views on “doom”. Paul Christiano. AI Alignment. 3 min read. 27 APRIL Learn more: AI Alignment I’m often asked: “what’s the probability of a really bad outcome from AI?” There are [...]
Future of Life Institute. REGULATE AI NOW.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. "Lights out for all of us." [Translation: Death for all humanity.] TRANSCRIPT. In March 2023 an open letter sounded the alarm on the training of giant [...]
Future of Life Institute Newsletter: Our Pause Letter, Six Months Later. October 01, 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. TRANSCRIPT. In March 2023 an open letter sounded the alarm on the training of giant AI experiments. It was signed by over 30,000 individuals including more than 2,000 industry leaders and more than [...]
“If the humans cannot unite then the alien intelligence will definitely win because […] then it’s a race to the bottom.” Yuval Noah Harari Audience Q&A @ CogX Festival 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. "If the humans cannot unite then the alien intelligence will definitely win because [...] then it's a race to the bottom." - Yuval Noah Hariri "Our superpower has always been [...]
Stuart Russell. Is AI an Existential Threat to Humanity? Southbank Centre, London. 28 SEPT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. 7:19 collaboration so thank you all so much for joining us I want to start by asking you to make your put put your stake in 7:26 [...]
REPORT. Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Navigating the Jagged Technological Frontier: Field Experimental Evidence of the Effects of AI on Knowledge Worker Productivity and Quality Harvard Business School Technology & Operations Mgt. Unit Working Paper No. 24-013 58 [...]
Meta. Introducing New AI Experiences Across Our Family of Apps and Devices. September 27, 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Meta Connect 2023: Everything Revealed in 10 Minutes Meta. Introducing New AI Experiences Across Our Family of Apps and Devices. September 27, 2023. Takeaways We’re starting to [...]
Extraordinary claims require extraordinary evidence. – Carl Sagan. No evidence here. None whatsoever… “Don’t Fear the Terminator”
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. A worrying read from an expert thought leader! (2019) Note: Unfortunately, four years hence, no evidence has yet been provided to support some of these extraordinary claims. The absence of scientific evidence, [...]
ANTHROPIC. Expanding access to safer AI with Amazon. Sep 25, 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ANTHROPIC. Expanding access to safer AI with Amazon. Sep 25, 2023. Today, we’re announcing that Amazon will invest up to $4 billion in Anthropic. The agreement is part of a broader collaboration [...]
VOX. The $1 billion gamble to ensure AI doesn’t destroy humanity. The founders of Anthropic quit OpenAI to make a safe AI company. It’s easier said than done. 25 SEPT.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. VOX. The $1 billion gamble to ensure AI doesn’t destroy humanity The founders of Anthropic quit OpenAI to make a safe AI company. It’s easier said than done. By Dylan Matthewsdylan@vox.com Updated [...]
WARNING. EXISTENTIAL RISK. The Next Pandemic Will Certainly Come Someday. (and COVID is not over)
WARNING. EXISTENTIAL RISK. Natural or Man-made... the NEXT PANDEMIC is CERTAINLY Coming. THE MAN-MADE RISK: Currently unregulated easily available and uncontained AI Technology combined with powerful advances in Biotechnology create an existential threat to billions of people from potential bio-terrorist actors' creation of synthetic pathogens, poisons and other unpredictable [...]
THE WASHINGTON POST. A flood of new AI products just arrived — whether we’re ready or not. 22 SEPT 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. THE WASHINGTON POST. A flood of new AI products just arrived — whether we’re ready or not. 22 SEPT 2023. Google, Microsoft, Amazon and OpenAI are in an arms race to push [...]
BBC NEWS. AI risks destabilising world, deputy PM to tell UN. 22 SEPT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. BBC NEWS. AI risks destabilising world, deputy PM to tell UN. 22 SEPT 2023. By Chris Vallance Technology reporter, BBC News Artificial intelligence could destabilise the world order unless [...]
Google Deepmind. Personality Traits in Large Language Models. 21 SEPT.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. A very good read from a respected source! Google Deepmind. Personality Traits in Large Language Models. Personality Traits in Large Language Models Greg Serapio-Garc ́ıa,1,2,3† Mustafa Safdari,1† Cle [...]
THE GUARDIAN. AI-focused tech firms locked in ‘race to the bottom’, warns MIT Physics Professor Max Tegmark. 21 SEPT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. THE GUARDIAN. AI-focused tech firms locked in ‘race to the bottom’, warns MIT professor. 21 SEPT 2023. Physicist Max Tegmark says competition too intense for tech executives to pause development to consider [...]
OpenAI Red Teaming Network. OpenAI Announces an open call for the OpenAI Red Teaming Network and invites domain experts interested in improving the safety of OpenAI’s models to join the efforts. 19 SEPT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. OpenAI Red Teaming Network OpenAI Announces an open call for the OpenAI Red Teaming Network and invites domain experts interested in improving the safety of OpenAI’s models to join the efforts. Apply [...]
NYT. How to Tell if Your A.I. Is Conscious. 18 SEPT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. REPORT: Consciousness in Artificial Intelligence: Insights from the Science of Consciousness ABSTRACT. Whether current or near-term AI systems could be conscious is a topic of scientific interest and increasing public concern. This [...]





















