Journal2025-07-25T13:43:09+00:00

First, do no harm.

1,500+ Posts…

Free knowledge sharing for Safe AI. Not for profit. Linkouts to sources provided. Ads are likely to appear on link-outs (zero benefit to this journal publisher)

Existential Risk Observatory. Does interpretability change everything? 07 OCT 2023.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Existential Risk Observatory. Does interpretability change everything? 07 OCT 2023. Significant news: two breakthroughs in interpretability, a subfield of AI Alignment, came out this week. What is AI Alignment again? AI Alignment [...]

THE WASHINGTON POST. ChatGPT provided better customer service than his staff. He fired them. Artificial intelligence is rapidly changing the world of customer service and call centers. Developing economies worry they’ll face the brunt. October 3, 2023.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ChatGPT provided better customer service than his staff. He fired them. Artificial intelligence is rapidly changing the world of customer service and call centers. Developing economies worry they’ll face the brunt. By [...]

REPORT. Low-Resource Languages Jailbreak GPT-4. 03 OCT 2023.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Low-Resource Languages Jailbreak GPT-4 Abstract AI safety training and red-teaming of large language models (LLMs) are measures to mitigate the generation of unsafe content. Our work exposes the inherent cross-lingual vulnerability of [...]

Future of Life Institute. REGULATE AI NOW.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. "Lights out for all of us." [Translation: Death for all humanity.] TRANSCRIPT. In March 2023 an open letter sounded the alarm on the training of giant [...]

ANTHROPIC. Expanding access to safer AI with Amazon. Sep 25, 2023.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ANTHROPIC. Expanding access to safer AI with Amazon. Sep 25, 2023. Today, we’re announcing that Amazon will invest up to $4 billion in Anthropic. The agreement is part of a broader collaboration [...]

Google Deepmind. Personality Traits in Large Language Models. 21 SEPT.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. A very good read from a respected source! Google Deepmind. Personality Traits in Large Language Models.  Personality Traits in Large Language Models Greg Serapio-Garc ́ıa,1,2,3† Mustafa Safdari,1† Cle [...]

OpenAI Red Teaming Network. OpenAI Announces an open call for the OpenAI Red Teaming Network and invites domain experts interested in improving the safety of OpenAI’s models to join the efforts. 19 SEPT 2023.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. OpenAI Red Teaming Network OpenAI Announces an open call for the OpenAI Red Teaming Network and invites domain experts interested in improving the safety of OpenAI’s models to join the efforts. Apply [...]

NYT. How to Tell if Your A.I. Is Conscious. 18 SEPT 2023.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. REPORT: Consciousness in Artificial Intelligence: Insights from the Science of Consciousness ABSTRACT.  Whether current or near-term AI systems could be conscious is a topic of scientific interest and increasing public concern. This [...]

Load More Posts
Go to Top