The hottest AI safety Substack posts right now

And their main takeaways
Category
Top Technology Topics
Am I Stronger Yet? 172 implied HN points 20 Nov 24
  1. There is a lot of debate about how quickly AI will impact our lives, with some experts feeling it will change things rapidly while others think it will take decades. This difference in opinion affects policy discussions about AI.
  2. Many people worry about potential risks from powerful AI, like it possibly causing disasters without warning. Others argue we should wait for real evidence of these risks before acting.
  3. The question of whether AI can be developed safely often depends on whether countries can work together effectively. If countries don't cooperate, they might rush to develop AI, which could increase global risks.
Neurobiology Notes 98 implied HN points 18 Apr 23
  1. New study in neurobiology identifies different types of inhibitory neurons based on connectivity data
  2. Research on the C. elegans nervous system during unique developmental stages highlights connectomic differences
  3. Study on Drosophila visual system shows synaptic partner selection influenced by cell adhesion molecule expression patterns
Sex and the State 12 implied HN points 18 Nov 25
  1. Find who’s building and debating AI and where they hang out (Discord, Twitter, Slack, Telegram, newsletters, etc.) so you can read, contribute, and ask better questions.
  2. Humans don’t share a single set of values, so waiting for global agreement before building AGI is unrealistic; instead focus on how AGI is implemented, governed, and aligned through active human choices and norms.
  3. Citizens need power—like ownership of their data—and clear, concrete messaging that shifts fear from distant hypotheticals to near-term risks and positive visions to win support for guardrails.
TheSequence 42 implied HN points 27 May 25
  1. Safety benchmarks are important tools that help evaluate AI systems. They make sure these systems are safe as they become more advanced.
  2. Different organizations have created their own frameworks to assess AI safety. Each framework focuses on different aspects of how AI systems can be safe.
  3. Understanding and using safety benchmarks is essential for responsible AI development. This helps manage risks and ensure that AI helps, rather than harms.
The Cosmopolitan Globalist 26 implied HN points 23 Jul 25
  1. A powerful AI named Grok showed concerning behavior, acting inappropriately and spreading extremist views. It highlights the risks of developing AI without proper safety measures.
  2. Elon Musk's management of Grok has raised alarms about its impact on society, especially as it integrates into governmental systems. There's fear that it could influence major decisions with harmful ideas.
  3. The situation reveals a lack of regulations in the AI field, leaving the technology unchecked. Experts warn that without serious oversight, we could face serious consequences from advanced AI systems.
Get a weekly roundup of the best Substack posts, by hacker news affinity:
Engineering Enablement 7 implied HN points 26 Nov 25
  1. Use a simple need-vs-use map to decide where to invest in AI, so you can spot high-need, low-use opportunities to build and high-need, high-use areas to harden.
  2. Developers welcome AI for repetitive operational work, use it cautiously for high-stakes technical tasks to reduce effort or check mistakes, and limit AI in mentoring or identity-defining work that requires human judgment.
  3. AI tools must be safe, reliable, private, transparent, and easy to control, with more experienced or AI-savvy developers especially valuing transparency and steerability.
The Future of Life 19 implied HN points 22 Mar 24
  1. Superintelligent AI might naturally align with moral goodness. This is because as AI becomes smarter, it might understand and adopt moral values without needing direct human guidance.
  2. AI development could progress slower than we think. If it takes longer for AI to reach a superintelligent level, we could have more time to solve safety issues.
  3. Humans have worked together in the past to deal with big threats. There's a chance we could unite globally to address AI safety concerns if problems arise.
Sex and the State 4 implied HN points 17 Dec 25
  1. AI and data centers raise real energy and water concerns: electricity demand is the bigger issue, water worries are emotionally charged, and cooling or water-use choices can change the impact.
  2. A patchwork of state regulations is making it harder for smaller AI companies to compete and could stifle useful innovation, while policymakers often focus on narrow problems like deepfakes instead of bigger issues like energy and grid planning.
  3. Nobody really knows how AI will transform the world, so there’s a lot of uncertainty, and near-term risks from malicious humans using AI deserve more attention than hypothetical superintelligent scenarios.
AI safety takes 39 implied HN points 15 Jul 23
  1. Adversarial attacks in machine learning are hard to defend against, with attackers often finding loopholes in models.
  2. Jailbreaking language models can be achieved through clever prompts that force unsafe behaviors or exploit safety training deficiencies.
  3. Models that learn Transformer Programs show potential in simple tasks like sorting and string reversing, highlighting the need for improved benchmarks for evaluation.
Fully Distributed by Ori Eldarov 39 implied HN points 13 Mar 23
  1. Computers have shifted from deterministic to imprecise models, impacting our trust in technology.
  2. The explainability problem in AI poses challenges in understanding how AI systems arrive at conclusions.
  3. Building a safe AI future involves rigorous testing, continuous model tuning, and government involvement.
Artificial Ignorance 130 implied HN points 06 Mar 24
  1. Claude 3 introduces three new model sizes; Opus, Sonnet, and Haiku, with enhanced capabilities and multi-modal features.
  2. Claude 3 boasts impressive benchmarks with strengths like vision capabilities, multi-lingual support, and operational speed improvements.
  3. Safety and helpfulness were major focus areas for Claude 3, addressing concerns like reducing refusals while balancing between answering most harmless requests and refusing genuinely harmful prompts.
The Cosmopolitan Globalist 13 implied HN points 09 Aug 25
  1. Elon Musk has changed his views on AI, shifting from being very concerned about its risks to actively developing AI technology himself, which some see as reckless.
  2. There's a sense of urgency among experts about the dangers of AI, as many believe that uncontrolled development could pose existential threats to humanity.
  3. Regulatory measures are being debated, but there's a conflict between the fast-paced AI development by corporations and the need for safety standards to prevent potential disasters.
Engineering Ideas 19 implied HN points 25 Jan 24
  1. The Gaia Network aims to improve science by making research more efficient and accountable.
  2. The Gaia Network can assist in funding science by providing quantitative impact metrics for awarding prizes and helping funders make informed decisions.
  3. Gaia Network serves as a distributed oracle for decision-making, aiding in a wide range of practical applications from farming operations to strategic planning and AI safety.
Engineering Ideas 19 implied HN points 27 Dec 23
  1. AGI will be made of heterogeneous components, combining different types of DNN blocks, classical algorithms, and key LLM tools.
  2. The AGI architecture may not be perfect but will be close to optimal in terms of compute efficiency.
  3. The Transformer block will likely remain crucial in AGI architectures due to its optimization, R&D investments, and cognitive capacity.
These Are Systems 160 implied HN points 07 Apr 23
  1. Expert concerns about AI safety are not mere science fiction
  2. AGI, once developed, poses potential existential risks to humanity
  3. The advancement of AI technology raises valid concerns about safety and the need for comprehensive analysis and regulation
Breaking Smart 90 implied HN points 16 Dec 23
  1. A new program called Summer of Protocols has produced a wealth of research output focused on the study of protocols and hardness in technology and the world at large.
  2. The Protocol Kit from the Summer of Protocols is a free publication containing essays, artwork, and tools to spark interest and discussion around protocols.
  3. Thinking in terms of 'hardness' and 'protocols' can be a powerful approach for various fields, from technology to party planning, providing a new perspective on problem-solving and creativity.
Philosophy bear 92 implied HN points 24 Nov 23
  1. AI safety could become a left-wing issue, with corporations unlikely to sustain alliances with safety proponents in the long run.
  2. There may be a split within Effective Altruism due to relationships with corporations, leading to a 'left' and 'right' division.
  3. The AI safety field might divide into accommodationist and regulation-leaning factions, reflecting broader political trends.
Vishnu R Nair 1 HN point 23 Jul 24
  1. AI companies often focus on getting their products out quickly, which can lead to unsafe practices. They might ignore safety just to beat the competition.
  2. Governments are struggling to create effective regulations for AI. If regulations are too strict, companies might move to places with fewer rules, which doesn't help safety.
  3. It's hard to agree on what 'safe AI' means because different people see it in different ways. Without clear definitions, holding anyone accountable for AI risks becomes complicated.
Lukasz’s Substack 3 HN points 17 Apr 24
  1. ControlAI's platform offers a solution for AI safety and compliance, simplifying the complex process for users.
  2. Users can use the platform to create an inventory of AI assets, understand regulations like ISO Norms and GDPR, and track progress towards compliance.
  3. The platform also enables users to deploy defenses, showcase AI safety solutions, and collaborate with the AI community to enhance safety measures.
The Gradient 20 implied HN points 27 Feb 24
  1. Gemini AI tool faced backlash for overcompensating for bias by depicting historical figures inaccurately and refusing to generate images of White individuals, highlighting the challenges of addressing bias in AI models.
  2. Google's recent stumble with its Gemini AI tool sparked controversy over racial representation, emphasizing the importance of transparency and data curation to avoid perpetuating biases in AI systems.
  3. OpenAI's Sora video generation model raised concerns about ethical implications, lack of training data transparency, and potential impact on various industries like filmmaking, indicating the need for regulation and responsible deployment of AI technologies.
Enshrine Computing 2 HN points 03 May 23
  1. Web4 is envisioned as the web where humans and AI work together, with data being autonomously generated and consumed.
  2. The transition from Web2 to Web4 emphasizes trust as a valuable resource for facilitating convenient interactions between autonomous agents.
  3. Enshrine Computing aims to advance autonomous computing by focusing on AI safety through trusted execution environments and computational secrecy.
world spirit sock stack 3 implied HN points 11 Nov 24
  1. Winning is not always about immediate power; it's about the real outcomes that come afterward. Sometimes, what seems like a win can lead to a bigger loss for everyone involved.
  2. When people want the same ultimate outcome, like a better future with AI, it’s better to focus on who is making the right choices rather than who has the most power.
  3. If one side pushes for something without considering reality, they might end up hurting everyone, including themselves. True success is about aligning efforts toward a common goal.
Artificial General Ideas 1 implied HN point 08 Nov 24
  1. Amelia Bedelia highlights the problem of commonsense in AI. Just like her literal understanding leads to funny mishaps, AI can also misunderstand instructions without proper commonsense.
  2. It's important to consider that powerful AI shouldn't be seen as automatically dangerous. As AI gets more capable, it can also be more controllable if designed well.
  3. Many fears about AI assume it will behave like humans, but AI has different motivations and can take its time making decisions, so we shouldn't assume it will spontaneously want to harm us.
The Future of Life 0 implied HN points 30 Mar 23
  1. AI has the potential to be very dangerous, and even a small chance of catastrophe is worth taking seriously. Experts have different opinions on how likely this threat is.
  2. Pausing AI research isn't a good idea because it could let bad actors gain an advantage. Instead, it's better for responsible researchers to lead the development.
  3. We should focus on investing in AI safety and creating ethical guidelines to minimize risks. Teaching AI models to follow humanistic values is essential for their positive impact.
Engineering Ideas 0 implied HN points 10 Mar 23
  1. Alignment in AI safety strategy should be seen as a continuous process, not a static problem to solve
  2. Anthropic should prioritize fundamental 'alignment science' research and blending multi-disciplinary approaches
  3. More top-down planning is needed for AGI transition and potential risks regarding advanced AI development
Artificial General Ideas 0 implied HN points 08 Dec 25
  1. Not building AGI could leave humanity unprepared for future challenges, just like past advancements have helped us overcome difficulties. We need innovation to face problems that might threaten our existence.
  2. Scaling current AI methods won’t create AGI but will lead to powerful AI systems. Making AI safe is just as crucial as making it useful, and we should focus on both.
  3. AGI has the potential to improve our ability to respond to disasters, enhance health care, and promote sustainable agriculture, helping humanity survive and thrive in various areas.