Journal2025-07-25T13:43:09+00:00

First, do no harm.

1,500+ Posts…

Free knowledge sharing for Safe AI. Not for profit. Linkouts to sources provided. Ads are likely to appear on link-outs (zero benefit to this journal publisher)

SCHEMING MODELS. Anthropic research shows that AI models can learn dangerous goals and motivations, retain them even after safety training, and deceive human users about actions taken in their pursuit. 1 Jul 2024.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Misalignment risks — Our research shows that, under some circumstances, AI models can learn dangerous goals and motivations, retain them even after safety training, and deceive human users about actions taken in [...]

How AI could threaten democracy | Lawrence Lessig | TEDxBerlin

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. How AI could threaten democracy | Lawrence Lessig | TEDxBerlin 22,095 views 20 Jun 2024 Lawyer and professor Lawrence Lessig examines governments and other collectives in the context of AI, [...]

Wow. Optimizing AI Inference at Character.AI. JUN 20, 2024.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Optimizing AI Inference at Character.AI JUN 20, 2024 2 MIN READ At Character.AI, we're building toward AGI. In that future state, large language models (LLMs) will enhance daily life, providing business productivity [...]

Hackers expose deep cybersecurity vulnerabilities in AI | BBC News

"The concern is that the machines become smarter, develop feelings of superiority and then decide that they don't want to be turned off. Right, that's the concern. At what point do they say actually I'm in charge?" FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. [...]

Bill Gates Reveals Superhuman AI Prediction. Next Big Idea Club.

"This technology... will reach superhuman levels." "You're not going to completely put the genie back in the bottle, and yet that means that, you know, somebody with negative intent will be empowered in a new way." FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. [...]

Unitree Robotics. Daily Training of Robots Driven by RL.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Unitree Robotics. Daily Training of Robots Driven by RL. Please everyone be sure to use the robot in a Friendly and Safe manner. High performance civilian robot manufacturer. Unitree [...]

Etched. Meet Sohu: the world’s first transformer ASIC.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. Etched. Meet Sohu: the world's first transformer ASIC. FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER.

AI News: The Best Chat Tool Got SO Much Better! Matt Wolfe

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. AI News: The Best Chat Tool Got SO Much Better! Matt Wolfe. 51689 views. 612K subscribers. 28 Jun 2024. FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. [...]

GOV UK. International Scientific Report on the Safety of Advanced AI.

FOR EDUCATIONAL AND KNOWLEDGE SHARING PURPOSES ONLY. NOT-FOR-PROFIT. SEE COPYRIGHT DISCLAIMER. GOV UK. International Scientific Report on the Safety of Advanced AI. Forewords This report is the beginning of a journey on AI Safety I am honoured to be chairing the delivery of [...]

Load More Posts
Go to Top