Journal2025-07-25T13:43:09+00:00

First, do no harm.

1,500+ Posts…

Free knowledge sharing for Safe AI. Not for profit. Linkouts to sources provided. Ads are likely to appear on link-outs (zero benefit to this journal publisher)

ANTHROPIC. Constitutional AI: Harmlessness from AI Feedback. Dec 15, 2022

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ANTHROPIC. Constitutional AI: Harmlessness from AI Feedback. Dec 15, 2022 Abstract As AI systems become more capable, we would like to enlist their help to supervise other AIs. We experiment with methods [...]

GPAI. The Global Partnership on Artificial Intelligence

 The Global Partnership on Artificial Intelligence (GPAI) is where governments and leading AI experts work together on values-based pathways for AI. It is a multi-stakeholder initiative which aims to bridge the gap between theory and practice. TRANSCRIPT. Artificial intelligence is [...]

ANTHROPIC. Predictability and Surprise in Large Generative Models. 03 OCT 2023

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Predictability and Surprise in Large Generative Models DEEP GANGULI∗, DANNY HERNANDEZ∗, LIANE LOVITT∗, NOVA DASSARMA†, TOM HENIGHAN†, ANDY JONES†, NICHOLAS JOSEPH†, JACKSON KERNION†, BEN MANN†, AMANDA ASKELL, YUNTAO BAI, ANNA CHEN, TOM [...]

GROKKING. NEW PHENOMENA. A Mechanistic Interpretability Analysis of Grokking. Models trained on small algorithmic tasks like modular addition will initially memorise the training data, but after a long time will suddenly learn to generalise to unseen data.

Grokking is the mysterious phenomenon of explosive machine learning. Learn more: QUICK STUDY on Twitter. A Mechanistic Interpretability Analysis of Grokking by Neel Nanda, Tom Lieberum. 15th Aug 2022 Introduction Grokking is a recent phenomena discovered by OpenAI researchers, that in my opinion is one of the most fascinating mysteries [...]

2022 Expert Survey on Progress in AI. 03 AUGUST 2022

2022 Expert Survey on Progress in AI. 03 AUGUST 2022 48% of respondents gave at least 10% chance of an extremely bad outcome We contacted approximately 4271 researchers who published at the conferences NeurIPS or ICML in 2021. These people were selected by taking all of the authors at those [...]

THE WHITE HOUSE. Cancer Moonshot

THE WHITE HOUSE. Cancer Moonshot CANCER MOONSHOT Share Your Actions Share Your Stories & Ideas Your Stories Events and Webinars Fact Sheets Fact Sheet: President Biden Reignites Cancer Moonshot to End Cancer as We Know It Biden-Harris Administration Sets Goal of Reducing Cancer [...]

Future of Life Institute. Slaughterbots – if human: kill()

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. This could be our future, unless we agree to prohibit lethal autonomous weapons. To learn what Slaughterbots are, how they pose a risk, and what solutions are available, visit [...]

Final Report. National Security Commission on Artificial Intelligence (NSCAI)

Final Report. National Security Commission on Artificial Intelligence (NSCAI) Executive Summary The Final Report presents the NSCAI’s strategy for winning the artificial intelligence era. The 16 chapters explain the steps the United States must take to responsibly use AI for national security and defense, defend [...]

Kubeflow. MLOps open-source tool by Google on top of Kubernetes

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Kubeflow. MLOps open-source tool by Google on top of Kubernetes Almost immediately after Kubernetes established itself as the standard for working with a cluster of containers, Google created [...]

Zoom In: An Introduction to Circuits. Published by OpenAI. March 10, 2020

Zoom In: An Introduction to Circuits By studying the connections between neurons, we can find meaningful algorithms in the weights of neural networks - OpenAI. March 10, 2020 Zoom In: An Introduction to Circuits By studying the connections between neurons, we can find meaningful algorithms in the weights of neural [...]

The Bitter Lesson. Rich Sutton. March 13, 2019.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. The Bitter Lesson. Rich Sutton. March 13, 2019. The biggest lesson that can be read from 70 years of AI research is that general methods that leverage computation are ultimately the most [...]

Load More Posts
Go to Top