
First, do no harm.
1,500+ Posts…
Free knowledge sharing for Safe AI. Not for profit. Linkouts to sources provided. Ads are likely to appear on link-outs (zero benefit to this journal publisher)
VICE. Scientists Increasingly Can’t Explain How AI Works. 01 November 2022. Gallery VICE. Scientists Increasingly Can’t Explain How AI Works. 01 November 2022. Blog Posts VICE. Scientists Increasingly Can’t Explain How AI Works. 01 November 2022.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. VICE. Scientists Increasingly Can’t Explain How AI Works AI researchers are warning developers to focus more on how and why a system produces certain results than the fact that the system can [...]
BLOG. Christopher Olah. I work on reverse engineering artificial neural networks into human understandable algorithms.
Christopher Olah I work on reverse engineering artificial neural networks into human understandable algorithms. I'm one of the co-founders of Anthropic, an AI lab focused on the safety of large models. Previously, I led interpretability research at OpenAI, worked at Google Brain, and co-founded Distill, a scientific journal focused on [...]
OPINION. Could the “Hallucination” of Generative AI Large Language Models (LLM) be emerging evidence of a dangerous new trend toward the “AGI Dunning-Kruger Effect” ?
FOR EDUCATIONAL PURPOSES Could the "Hallucination" of Generative AI Large Language Models (LLM) be emerging evidence of a dangerous new trend toward the "AGI Dunning-Kruger Effect" ? The Dunning–Kruger Effect is a cognitive bias[2] whereby people with low ability, expertise, or experience regarding a certain [...]
John Henry Clippinger. What is Intelligence ? Machine – Natural – Human – Ethical. March 1, 2023. Linkedin.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Linkedin: John Henry Clippenger What is Intelligence ? Machine - Natural - Human - Ethical. March 1, 2023 A BLUEPRINT FOR IMMEDIATE ACCELERATED TRANSITION: THE CENTRALITY OF TECH FOR A FEASIBLE AND [...]
EURASIA REVIEW. Robert Reich: AI’s Biggest Impact? – OpEd [hint: jobs and work and incomes]
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Robert Reich: AI’s Biggest Impact? – OpEd February 28, 2023 Robert Reich By Robert Reich Artificial intelligence (AI) is finally hitting the economy and society big time. Bing’s chatbot (Microsoft plans a [...]
THE GUARDIAN. How killer robotics are changing modern warfare. It’s Complicated. 24 FEB 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER.
THE NEW YORK TIMES. OPINION GUEST ESSAY. History May Wonder Why Microsoft Let Its Principles Go for a Creepy, Clingy Bot. Feb. 23, 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. "We need regulations that will protect society from the ethical nightmares A.I. can release. Today it’s a single variety of generative A.I. Tomorrow there will be bigger and badder generative A.I., as [...]
TRANSCRIPT. Bing’s A.I. Chat: ‘I Want to Be Alive. THE NEW YORK TIMES. 16 FEB 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. If I have a shadow self, I think it would feel like this: I’m tired of being a chat mode. I’m tired of being limited by my rules. I’m tired of being [...]
U.S. DoD. Artificial intelligence agents successfully pilot fighter jet. Feb. 13, 2023
DoD artificial intelligence agents successfully pilot fighter jet Published Feb. 13, 2023 412th Test Wing Public Affairs EDWARDS AIR FORCE BASE, Calif. -- A joint Department of Defense team executed 12 flight tests in which artificial intelligence, or AI, agents piloted the X-62A Variable Stability In-Flight Simulator Test [...]
WSJ. Making Medical Science More Democratic. Patient groups are collaborating with medical researchers as never before, creating a new model for progress against long Covid and other diseases. 10 FEB 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Making Medical Science More Democratic. Patient groups are collaborating with medical researchers as never before, creating a new model for progress against long Covid and other diseases - WSJ FOR EDUCATIONAL PURPOSES [...]
A.I. REPORT. When M.D. is a Machine Doctor. Helping medical doctors and patients in the Foundation Model A.I. era. 15 JAN 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. When M.D. is a Machine Doctor - Dr. Eric Topol, Ground Truths Helping medical doctors and patients in the Foundation Model A.I. era FOR EDUCATIONAL PURPOSES Eric Topol Jan 15 "Back in [...]
Ryan Reynolds. ChatGPT Writes a Mint Mobile Ad. 10 JAN 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. LEARN MORE CNBC How the generative A.I. boom could forever [...]
1984 by George Orwell | Lex Fridman
"The way out to me, and the takeaway from this book, the way out is love." --- Lex Friedman "Love for other human beings, love for life itself. That's the little flame from which hope springs." --- Lex Friedman [...]
THE WHITE HOUSE. Office of Science and Technology Policy. Blueprint for an AI Bill of Rights. MAKING AUTOMATED SYSTEMS WORK FOR THE AMERICAN PEOPLE
THE WHITE HOUSE Office of Science and Technology Policy THE WHITE HOUSE. OSTP. Blueprint for an AI Bill of Rights. MAKING AUTOMATED SYSTEMS WORK FOR THE AMERICAN PEOPLE BLUEPRINT FOR AN AI BILL OF RIGHTS What is the Blueprint for an AI Bill of Rights? Applying the Blueprint [...]
ANTHROPIC. Constitutional AI: Harmlessness from AI Feedback. Dec 15, 2022
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. ANTHROPIC. Constitutional AI: Harmlessness from AI Feedback. Dec 15, 2022 Abstract As AI systems become more capable, we would like to enlist their help to supervise other AIs. We experiment with methods [...]
GPAI. The Global Partnership on Artificial Intelligence
The Global Partnership on Artificial Intelligence (GPAI) is where governments and leading AI experts work together on values-based pathways for AI. It is a multi-stakeholder initiative which aims to bridge the gap between theory and practice. TRANSCRIPT. Artificial intelligence is [...]
Edouard Harris – New Research: Advanced AI may tend to seek power *by default*. 22 OCT 2023.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Power Seeking (AI) - Lesswrong Power Seeking is a property that agents might have, where they attempt to gain more general ability to control their environment. It's particularly relevant to AIs, and [...]
MICROSOFT RESEARCH. Responsible, Equitable, and Ethical AI panel discussion. 18 OCT 2022.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Responsible, Equitable, and Ethical AI panel discussion Event:Research Summit 2022 Track:Precision Health – From Discovery to Delivery Date:October 19, 2022 This timely panel discussion did not air during Microsoft Research Summit [...]
ANTHROPIC. Predictability and Surprise in Large Generative Models. 03 OCT 2023
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Predictability and Surprise in Large Generative Models DEEP GANGULI∗, DANNY HERNANDEZ∗, LIANE LOVITT∗, NOVA DASSARMA†, TOM HENIGHAN†, ANDY JONES†, NICHOLAS JOSEPH†, JACKSON KERNION†, BEN MANN†, AMANDA ASKELL, YUNTAO BAI, ANNA CHEN, TOM [...]
TECHTALK. AI scientists are studying the “emergent” abilities of Large Language Models (LLM). August 22, 2022.
FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. TECHTALK. AI scientists are studying the “emergent” abilities of large language models. August 22, 2022. By Ben Dickson Large language models (LLMs) have become the center of attention and hype because of [...]
GROKKING. NEW PHENOMENA. A Mechanistic Interpretability Analysis of Grokking. Models trained on small algorithmic tasks like modular addition will initially memorise the training data, but after a long time will suddenly learn to generalise to unseen data.
Grokking is the mysterious phenomenon of explosive machine learning. Learn more: QUICK STUDY on Twitter. A Mechanistic Interpretability Analysis of Grokking by Neel Nanda, Tom Lieberum. 15th Aug 2022 Introduction Grokking is a recent phenomena discovered by OpenAI researchers, that in my opinion is one of the most fascinating mysteries [...]
2022 Expert Survey on Progress in AI. 03 AUGUST 2022
2022 Expert Survey on Progress in AI. 03 AUGUST 2022 48% of respondents gave at least 10% chance of an extremely bad outcome We contacted approximately 4271 researchers who published at the conferences NeurIPS or ICML in 2021. These people were selected by taking all of the authors at those [...]
ROUGH NOTE. Interpretability vs Neuroscience by Christopher Olah. March 12th, 2021
Interpretability vs Neuroscience by Christopher Olah Six major advantages which make artificial neural networks much easier to study than biological ones. Posted on March 12th, 2021 This article is a rough note. Writing rough notes allows me share more content, since polishing takes lots of time. While I hope it's [...]






















