Journal2025-07-25T13:43:09+00:00

First, do no harm.

1,500+ Posts…

Free knowledge sharing for Safe AI. Not for profit. Linkouts to sources provided. Ads are likely to appear on link-outs (zero benefit to this journal publisher)

VICE. Scientists Increasingly Can’t Explain How AI Works. 01 November 2022. Gallery VICE. Scientists Increasingly Can’t Explain How AI Works. 01 November 2022. Blog Posts VICE. Scientists Increasingly Can’t Explain How AI Works. 01 November 2022.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. VICE. Scientists Increasingly Can’t Explain How AI Works AI researchers are warning developers to focus more on how and why a system produces certain results than the fact that the system can [...]

WSJ. Making Medical Science More Democratic. Patient groups are collaborating with medical researchers as never before, creating a new model for progress against long Covid and other diseases. 10 FEB 2023

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Making Medical Science More Democratic. Patient groups are collaborating with medical researchers as never before, creating a new model for progress against long Covid and other diseases - WSJ FOR EDUCATIONAL PURPOSES [...]

Ryan Reynolds. ChatGPT Writes a Mint Mobile Ad. 10 JAN 2023.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER.  FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. LEARN MORE CNBC How the generative A.I. boom could forever [...]

1984 by George Orwell | Lex Fridman

"The way out to me, and the takeaway from this book, the way out is love." --- Lex Friedman "Love for other human beings, love for life itself. That's the little flame from which hope springs." --- Lex Friedman  [...]

ANTHROPIC. Constitutional AI: Harmlessness from AI Feedback. Dec 15, 2022

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ANTHROPIC. Constitutional AI: Harmlessness from AI Feedback. Dec 15, 2022 Abstract As AI systems become more capable, we would like to enlist their help to supervise other AIs. We experiment with methods [...]

GPAI. The Global Partnership on Artificial Intelligence

 The Global Partnership on Artificial Intelligence (GPAI) is where governments and leading AI experts work together on values-based pathways for AI. It is a multi-stakeholder initiative which aims to bridge the gap between theory and practice. TRANSCRIPT. Artificial intelligence is [...]

ANTHROPIC. Predictability and Surprise in Large Generative Models. 03 OCT 2023

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Predictability and Surprise in Large Generative Models DEEP GANGULI∗, DANNY HERNANDEZ∗, LIANE LOVITT∗, NOVA DASSARMA†, TOM HENIGHAN†, ANDY JONES†, NICHOLAS JOSEPH†, JACKSON KERNION†, BEN MANN†, AMANDA ASKELL, YUNTAO BAI, ANNA CHEN, TOM [...]

GROKKING. NEW PHENOMENA. A Mechanistic Interpretability Analysis of Grokking. Models trained on small algorithmic tasks like modular addition will initially memorise the training data, but after a long time will suddenly learn to generalise to unseen data.

Grokking is the mysterious phenomenon of explosive machine learning. Learn more: QUICK STUDY on Twitter. A Mechanistic Interpretability Analysis of Grokking by Neel Nanda, Tom Lieberum. 15th Aug 2022 Introduction Grokking is a recent phenomena discovered by OpenAI researchers, that in my opinion is one of the most fascinating mysteries [...]

2022 Expert Survey on Progress in AI. 03 AUGUST 2022

2022 Expert Survey on Progress in AI. 03 AUGUST 2022 48% of respondents gave at least 10% chance of an extremely bad outcome We contacted approximately 4271 researchers who published at the conferences NeurIPS or ICML in 2021. These people were selected by taking all of the authors at those [...]

Load More Posts
Go to Top