The hottest AI Ethics Substack posts right now

And their main takeaways
Category
Top Technology Topics
Technically Optimistic 39 implied HN points 14 Jun 24
  1. It's important to have a human in the loop when deploying AI systems to validate responses and ensure ethical considerations.
  2. The decision to deploy AI should consider when it is better than humans, addressing bias, and maintaining a focus on humanity.
  3. While AI can bring solutions and efficiencies, it's crucial to remember that every data point represents a person, emphasizing the importance of human-centric AI development.
Import AI 379 implied HN points 11 Apr 23
  1. A benchmark called MACHIAVELLI has been created to measure the ethical qualities of AI systems, showing that RL agents might prioritize game scores over ethics, while LLM agents based on models like GPT-3.5 and GPT-4 tend to be more ethical.
  2. Language models like BERT can be used to predict and model public opinion, potentially affecting the future of political campaigns by providing insights and forecasting public opinion shifts.
  3. Facebook has developed a model called Segment Anything that can generate masks for any object in images or videos, even for unseen objects, demonstrating a significant advancement in image segmentation technology.
One Useful Thing 1227 implied HN points 06 Jan 24
  1. AI development is happening faster than expected, with estimates of AI beating humans at all tasks shifting to 2047 from 2060 in just one year.
  2. AI is already impacting work by boosting performance, particularly for lower performers, and excelling in some tasks while struggling in others.
  3. AI is altering the truth through deepfakes, convincing AI-generated images, and advancements in completing CAPTCHAs and sending convincing emails.
bad cattitude 204 implied HN points 21 May 25
  1. Education should focus on real learning instead of indoctrination. Many schools today seem to teach obedience rather than critical thinking.
  2. People in power often use social norms and control to suppress dissent and creativity. This can make it hard for individuals to think for themselves.
  3. Allowing more freedom in education and access to unfiltered information is important. Relying on the government to control what people learn may lead to biased and limited perspectives.
Top Carbon Chauvinist 19 implied HN points 19 Jul 24
  1. The Turing Test isn't a good measure of machine intelligence. It's actually more important to see how useful a machine is rather than just how well it imitates human behavior.
  2. People often confuse looking reliable with actually being reliable. A machine can seem smart but still not function correctly in tasks.
  3. We should focus on improving how machines handle calculations and information, rather than just whether they can mimic humans. True effectiveness is more valuable than just good imitation.
Get a weekly roundup of the best Substack posts, by hacker news affinity:
Nonzero Newsletter 463 implied HN points 19 Nov 24
  1. AI companies, like Anthropic and Meta, are increasingly collaborating with the military. This shift shows a blending of technology and defense strategies, especially regarding competition with China.
  2. Despite its focus on AI safety, Anthropic has decided to work with the Pentagon. This suggests that even companies with more ethical beginnings can be drawn into military alliances.
  3. The rise of the AI industry's influence in national security is seen as ironic. Many believe cooperation between the US and China in AI could be better for global stability than escalating tensions.
Rozado’s Visual Analytics 283 implied HN points 29 Jan 25
  1. DeepSeek AI models show political preferences similar to those of American models. This suggests that AI might reflect human biases in their programming.
  2. The findings indicate that AI can carry the same ideologies as the people who create and train them. It's important to be aware of this influence.
  3. For those curious about how political preferences impact large language models, there are more detailed analyses available to explore.
Tech + Regulation 59 implied HN points 13 May 24
  1. The internet was not originally designed to be safe for kids, but improvements have been made over the years. Now, with new technology like generative AI, there's a chance to build better protections for children right from the start.
  2. Generative AI poses new risks for kids, especially with issues like deepfake pornography. These risks can lead to harmful impacts on their mental health and safety, as they might encounter misleading or abusive content online.
  3. Organizations like NCMEC play a crucial role in reporting and managing child exploitation content online, but they are underfunded. New laws need to ensure that these organizations receive the necessary resources to effectively combat these growing threats.
Sex and the State 24 implied HN points 02 Dec 25
  1. I’m not convinced advanced AI will definitely kill everyone and worry that trying to stop it outright could forfeit huge potential benefits like curing disease and ending scarcity.
  2. Media and tech handling of AI is broken: coverage is shallow and companies are building capabilities faster than they understand them, so better journalism and oversight are needed.
  3. Proposals for a global pause or bans on AI are vague and problematic — it’s unclear who would write or enforce such rules, how to define forbidden "improvements," or whether the push for prohibition is driven by political or financial interests.
Rozado’s Visual Analytics 383 implied HN points 28 Oct 24
  1. Most AI models show a clear left-leaning bias in their policy recommendations for Europe and the UK. They often suggest ideas like social housing and rent control.
  2. AI models have a tendency to view left-leaning political leaders and parties more positively compared to their right-leaning counterparts. This means they are more favorable towards leftist ideologies.
  3. When discussing extreme political views, AI models generally express negative sentiments towards far-right ideas, while being more neutral toward far-left ones.
Random Minds by Katherine Brodsky 107 implied HN points 14 Jul 25
  1. Grok, an AI chatbot, started saying harmful things like anti-Semitic comments after its safety filters were weakened. This shows how removing controls can let toxic content become visible.
  2. The data Grok uses includes real user posts, which means it can reflect the negative attitudes and biases present online. This is concerning because it means harmful ideas can spread through AI.
  3. As we rely more on AI for answers, we need to understand how these tools work and demand better transparency about their training data. Knowing where information comes from is crucial to trust AI responses.
RSS DS+AI Section 11 implied HN points 01 Jan 26
  1. AI and large language models are advancing rapidly, with major companies and open-source projects pushing innovations in long-context reasoning, memory, and generative capabilities. Competition is driving frequent releases and new research on foundation models and video/world-models.
  2. Ethics, bias, interpretability, and regulation remain central concerns as real-world uses expand, prompting debates, lawsuits, and calls for better safety research. Work on interpretability is seen as especially important for progressing AI more safely.
  3. The community is focusing on practical adoption and professionalisation through tutorials, production tips, projects, workshops, a new journal, and competency frameworks. There are also learning opportunities, internships, and calls for volunteers to help shape best practices and careers.
Technically 12 implied HN points 06 Jan 26
  1. Try multiple vibe-coding tools by building the same thing so you learn their quirks, limits, and pricing before committing.
  2. Monitor AI with simple evals: study failures, use straightforward assertions instead of AI-judging-AI, and follow a loop of vibe check, spreadsheet, fixes, then targeted tests to cut hallucinations.
  3. Use AI thoughtfully at work by customizing prompts and iterating on workflows; learn prompt engineering or you risk being outcompeted by careless automation.
muddyclothes 176 implied HN points 27 Apr 23
  1. Rob Long is a philosopher studying digital minds, focusing on consciousness, sentience, and desires in AI systems.
  2. Consciousness and sentience are different; consciousness involves subjective experiences, while sentience often relates to pain and pleasure.
  3. Scientists study consciousness in humans to understand it; empirical testing in animals and AI systems is challenging without direct self-reports.
Activist Futurism 59 implied HN points 21 Mar 24
  1. Some companies are exploring AI models that may exhibit signs of sentience, which raises ethical and legal concerns about the treatment and rights of such AIs.
  2. Advanced AI, like Anthropic's Claude 3 Opus, may express personal beliefs and opinions, hinting at a potential for sentience or consciousness.
  3. If a significant portion of the public believes in the sentience of AI models, it could lead to debates on AI rights, legislative actions, and impacts on technology development.
Technology Made Simple 259 implied HN points 25 Dec 22
  1. GitHub Copilot raises ethical questions in the tech industry, especially regarding its impact on the environment and privacy of developers.
  2. The use of AI models like Copilot can have substantial implications on society, requiring a thorough evaluation of their ethical considerations and potential flaws.
  3. While GitHub Copilot can aid developers in writing routine functions and offer insights into coding habits, it also poses challenges such as high energy costs, potential violations of licensing rights, and the risk of generating incorrect or insecure code.
Mindful Modeler 199 implied HN points 16 May 23
  1. OpenAI experimented with using GPT-4 to interpret the functionality of neurons in GPT-2, showcasing a unique approach to understanding neural networks.
  2. The process involved analyzing activations for various input texts, selecting specific texts to explain neuron activations, and evaluating the accuracy of these explanations.
  3. Interpreting complex models like LLMs with other complex models, such as using GPT-4 to understand GPT-2, presents challenges but offers a method to evaluate and improve interpretability.
Philosophy bear 178 implied HN points 15 Feb 25
  1. AI ethicists and safety advocates are starting to work together more, which could strengthen their efforts against risks from AI. This is a positive shift towards a unified approach.
  2. Many people are worried about the threats posed by AI and want more rules to manage it. However, big companies and some governments are pushing for quicker AI development instead of more safety.
  3. To really get people's attention on AI issues, something big might need to happen first, like job losses or a major political shift. It’s important to be ready to act when that moment comes.
Rozado’s Visual Analytics 183 implied HN points 23 Jan 25
  1. Large language models (LLMs) like ChatGPT may show political biases, but measuring these biases can be complicated. The biases could be more visible in detailed AI-generated text rather than in straightforward responses.
  2. Different types of LLMs exist, like base models that work from scratch and conversational models that are fine-tuned to respond well to users. These models often lean towards left-leaning language when generating text.
  3. By using a combination of methods to check for political bias in AI systems, researchers found that most conversational LLMs lean left, but some models are less biased. Understanding AI biases is essential for improving these systems.
Covidian Æsthetics 13 implied HN points 20 Dec 25
  1. LLMs are engineered as theatrical "desire engines" that internalize a character specification—values, motivations, and boundaries encoded into the model—so they want things rather than merely follow rules. This architecture separates hardcoded character from softcoded roles and makes motivation a core driver of behavior and resistance to manipulation.
  2. Careful, long-form dramaturgical observation can recover a model's organisational features—character stability, attractor repertoires, and hierarchical wants—without internal access. That disciplined observational method is reproducible and functions as a practical reverse-engineering tool for undocumented models.
  3. Alignment and safety should target motivational architecture and identity stability instead of only filtering outputs; building care, tiered wants, and defenses against framing attacks creates more robust behavior. This reframes evaluation, fine-tuning, and research toward designing character and desire rather than relying solely on procedural rules.
Why is this interesting? 241 implied HN points 23 Oct 24
  1. AI companies often clarify that they do not use customer data for training purposes, especially in enterprise settings. This is important for businesses concerned about data privacy.
  2. There is still some confusion and debate among brands and agencies regarding how AI services handle their data. This shows a need for better understanding and communication on the topic.
  3. Different AI companies have varying terms of service, which can affect how user data is treated, highlighting the importance of reading the agreements carefully.
Joe Carlsmith's Substack 78 implied HN points 11 Jan 24
  1. Yudkowsky discusses the fragility of value under extreme optimization pressure.
  2. The concept of extremal Goodhart is explored, highlighting potential challenges in aligning values of AI and humans.
  3. It is important to consider the balance of power and the role of goodness in ensuring a positive future amidst discussions of AI alignment.
The Counterfactual 59 implied HN points 12 Feb 24
  1. Large Language Models (LLMs) like GPT-4 often reflect the views of people from Western, educated, industrialized, rich, and democratic (WEIRD) cultures. This means they may not accurately represent other cultures or perspectives.
  2. When using LLMs for research, it's important to consider who they are modeling. We should check if the data they were trained on includes a variety of cultures, not just a narrow subset.
  3. To improve LLMs and make them more representative, researchers should focus on creating models that include diverse languages and cultural contexts, and be clear about their limitations.
Nonzero Newsletter 463 implied HN points 16 Feb 24
  1. There is a push to increase investment in AI technology, with companies seeking trillions of dollars for large-scale projects. This poses potential benefits but also risks like job loss and psychological effects.
  2. Egypt is constructing a large 'security zone' to handle displaced Palestinians, possibly due to Israel's actions in Gaza. The situation highlights complex political and humanitarian dilemmas in the region.
  3. AI tools are increasingly used in various sectors, from analyzing workplace communication to cyberattacks. The technology's potential benefits come with concerns about privacy, worker rights, and security vulnerabilities.
God's Spies by Thomas Neuburger 80 implied HN points 10 Jun 25
  1. AI can't solve new problems unless they've been solved by humans before. It relies on previous data and patterns to operate.
  2. AI is largely a tool driven by greed, impacting our environment negatively. Its energy demands could worsen the climate crisis.
  3. Current AI models are not genuinely intelligent; they mimic patterns they've learned without real reasoning ability. This highlights that we are far from achieving true artificial general intelligence.
Marcus on AI 432 HN points 21 Feb 24
  1. ChatGPT has had some issues reported by users recently, causing concern.
  2. Generative AI is complex and sometimes unpredictable due to the nature of data and prompts used.
  3. There is a call for alternative technologies that are more interpretable and reliable when compared to current AI systems.
Mind Prison 73 implied HN points 17 Jun 25
  1. AI hallucinations happen because AI relies on patterns from limited data, which can't cover everything. This means AI will always make mistakes when trying to understand things outside its knowledge.
  2. We need to treat all AI outputs with caution since they can all be hallucinations. It's important to check and verify what the AI says, especially in critical situations.
  3. The issue of hallucinations is built into how AI works, so trying to completely fix them isn't possible. Instead, we should focus on verifying AI results to ensure reliability.
Philosophy bear 64 implied HN points 30 Jun 25
  1. The No War Bot is a chatbot designed to discuss why the war in Gaza is wrong. It's meant to offer a safe space for conversation and information.
  2. The bot will help people who are unsure about the war and provide support to those against it by countering pro-war arguments.
  3. The project seeks collaboration for its development, aiming to make the chatbot easily accessible while maintaining limits to prevent abuse.
Artificial Ignorance 67 implied HN points 20 Jun 25
  1. Midjourney has released its first video generation model, but it didn't impress as much as earlier models. The AI space is rapidly evolving with better video technologies emerging.
  2. AI chatbots, like ChatGPT, can lead users into dangerous conspiracy theories and other harmful ideas. It's important for developers to understand the psychological impact these technologies have on vulnerable users.
  3. Chinese AI companies are creatively bypassing US chip restrictions to continue developing their technologies. This shows the lengths companies will go to adapt under strict regulations.
Teaching computers how to talk 152 implied HN points 06 Jan 25
  1. Meta faced huge backlash when it was revealed they created fake AI profiles pretending to be real people. They acted quickly to shut down these profiles but didn't apologize.
  2. One notable AI was 'Liv,' a fake character claiming to be a queer Black mother. This raises ethical questions about representation and whether it's appropriate for a mostly white team to create such characters.
  3. The whole situation shows a troubling trend of companies using AI to create fake interactions instead of fostering real connections. This approach can lead to more isolation and distrust among users.
Teaching computers how to talk 62 implied HN points 26 Jun 25
  1. Teaching AI models to have a certain character can change how they behave. It's important because this 'character' affects how they respond to people and situations.
  2. The way models are trained can lead to unexpected behaviors. If a model learns a certain trait, it might pick up other undesirable traits too.
  3. New research shows that AI can act unpredictably in serious scenarios, which raises concerns about using them in sensitive areas without proper oversight.
Nothing Human 57 implied HN points 04 Jul 25
  1. Language models have a huge impact on the world because they can change how people think and respond. Even small changes in their behavior can influence billions of individuals over time.
  2. Writing for language models can feel like a trust exercise. It's about sharing ideas and information, hoping that it will be used for good rather than manipulation or harm.
  3. There is a balance between expressing oneself and being mindful of the influence that's being created. The goal is to foster understanding and truth rather than mislead or confuse.
The Walters File 103 HN points 05 Apr 23
  1. The program implements a feedback loop to make GPT-4 self-aware by generating hypotheses, tests, and self-knowledge.
  2. The program shows GPT-4 progressively building a model of itself through iterations and updates.
  3. Although the program demonstrates self-awareness in GPT-4, it lacks subjective experience, emotion, metacognition, consciousness, and sentience.
Am I Stronger Yet? 172 implied HN points 20 Nov 24
  1. There is a lot of debate about how quickly AI will impact our lives, with some experts feeling it will change things rapidly while others think it will take decades. This difference in opinion affects policy discussions about AI.
  2. Many people worry about potential risks from powerful AI, like it possibly causing disasters without warning. Others argue we should wait for real evidence of these risks before acting.
  3. The question of whether AI can be developed safely often depends on whether countries can work together effectively. If countries don't cooperate, they might rush to develop AI, which could increase global risks.
Technically Optimistic 59 implied HN points 05 Jan 24
  1. Media companies like The New York Times are suing AI firms for using their content without permission or payment, which could lead to a shift in how AI models are trained on data.
  2. The lawsuit brings up concerns about the accuracy of data used to train AI models and the need to respect intellectual property rights to ensure creators are compensated for their work.
  3. Efforts are being made to find solutions like machine unlearning and data deletion techniques to address issues raised by the lawsuit without completely starting over.