The hottest Data Substack posts right now

And their main takeaways

AI, Secrets, and 38TB

aidaily • 19 implied HN points • 21 Sep 23

🕹 Technology AI Data Marketing Tools Business

AI technology can help keep cities clean by identifying litter hotspots
Microsoft's AI researchers accidentally exposed 38 terabytes of private data on GitHub
Adding a human touch to business strategies can be beneficial alongside AI

Incorrectness Cascades - Three small follow-ups

From AI to ZI • 19 implied HN points • 10 May 23

🕹 Technology Research Models Analysis Data Experiment

Testing higher X values for more insights.
GPT-4 is faster but less safe in producing incorrect answers.
Analyzing model accuracy based on different questions reveals intriguing patterns.

Large language models in security

Davis Treybig • 19 implied HN points • 15 Apr 23

🕹 Technology Security AI Analysis Automation Data

Large language models (LLMs) are being used in security for tasks like logs analysis and incident response.
LLMs are changing the landscape of traditional static analysis tools in cloud and application security.
LLMs have the potential to automate processes like vendor security questionnaires and enhance engineer-oriented security workflows.

GPT-4, Multimodal?

Sector 6 | The Newsletter of AIM • 19 implied HN points • 03 Aug 23

🕹 Technology AI Gadgets Software Innovation Data

OpenAI is moving quickly to develop GPT-5, but there are concerns about the features of GPT-4, especially its promised multimodal capabilities.
When GPT-4 was launched, it was said to include advanced image input options through a partnership, but these features are still not widely available.
Currently, the multimodal features of GPT-4 are limited and not accessible through the usual API, leaving users wanting more updates and access.

Using context-consideration framework for EdTech AI products

Knowledge Shots • 19 implied HN points • 29 Apr 23

🕹 Technology EdTech AI Market fit Models Data

High market-fit AI products consider context and user consideration for decision-making.
Niche targeting is key for AI products in EdTech, focusing on specific user personas.
Access to specific 'graduate education data' can create a strong technological advantage in the market.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

Algorithmic Bitcoin Mining

nicosmid • 19 implied HN points • 04 Apr 23

🕹 Technology Bitcoin Mining Data Efficiency Automation

Algorithmic Bitcoin mining automates tasks for increased efficiency and less human intervention.
Real-time data is used to optimize mining operations.
Benefits of algorithmic mining include efficiency, reduced energy consumption, and improved hardware health.

How I started with the Power Platform

Power Platform News • 19 implied HN points • 05 Aug 23

🕹 Technology Development Applications Microsoft Data Automation

The user shares their journey of building a Powerapps application to track multiple sites and divisions, showcasing the power of Power Platform in simplifying complex tasks.
They found Microsoft Powerapps to be a user-friendly solution for building applications, especially when integrated with SharePoint Online lists for data management.
By optimizing user onboarding through automation with Power Automate, they efficiently scaled to onboard 700 users in a week, demonstrating the importance of streamlining processes for user support.

The Ball’s in Your Court

Sector 6 | The Newsletter of AIM • 19 implied HN points • 11 Aug 23

🕹 Technology AI Data Chips Copyright Big Tech

Big Tech companies are finding clever ways to use internet data for their AI projects, even with new copyright laws in place.
Semiconductor companies are developing chips specifically for the Chinese market that almost meet US rules, showing a creative approach to regulations.
Generative AI tools like GoogleBot and GPTBot are accessing online content unless website owners clearly say no, which raises questions about data usage.

Getting IBM X-Force Exchange Threat Intelligence TAXII Service Information for Use with Microsoft Sentinel

Rod’s Blog • 19 implied HN points • 11 Apr 23

🕹 Technology Security Software Data

To access IBM X-Force Exchange Threat Intelligence for Microsoft Sentinel, get an account at exchange.xforce.ibmcloud.com and retrieve API key and password.
Once you have the API info, input it in the provided areas on the IBM X-Force Exchange API Docs page.
To use the Threat Intelligence - TAXII connector in Microsoft Sentinel, provide your API information and use a Curl utility to show available Collection IDs.

Whale Songs Explained

David’s Substack • 19 implied HN points • 03 Oct 23

🕹 Technology Privacy Blockchain Cryptography Security Data

Whale Songs allows anonymous tweeting from accounts with $1M in on-chain assets
Spartan-ECDSA is an important tool for zero-knowledge proof circuits
Challenges include handling large datasets, computationally intensive processes, and server limitations

The Improvisational Art of Innovation: How Jazz, Comedy, and Intuition Can Transform Business and Life

⭐️Bob’s Newsletter • 19 implied HN points • 02 Apr 23

💼 Business Innovation Creativity Data Intuition Leadership

Trust your intuition to drive true innovation and creativity more than data.
Data sets the foundation, but creativity and intuition transform it into innovation.
Embrace curiosity, diversity of thought, and active listening to unlock potential for innovative problem-solving.

Personalization Uncovered: The Surprising Ways Hyper-Targeted Advertising Influences Our Thoughts, Choices, and Lives

⭐️Bob’s Newsletter • 19 implied HN points • 30 Mar 23

🕹 Technology Advertising Data Privacy Personalization

Hyper-targeted advertising limits exposure to new ideas and unexpected discoveries.
Personalized advertising may give a false sense of autonomy in our consumption choices.
Repetitive hyper-targeted ads can diminish real emotional connections with brands and others.

Are You Living in Musk's Simulation?

Sector 6 | The Newsletter of AIM • 19 implied HN points • 07 Aug 23

🕹 Technology AI Software Innovation Data Internet

Elon Musk recently acquired a key domain, ai.com, which might shape the future of AI significantly. Controlling AI means having a major influence on global events.
There's a lot of discussion around whether we could be living in a simulation, and Musk has jokingly suggested avoiding those talks in casual settings.
Many believe that whoever controls AI technology controls important aspects of society, which raises questions about power and responsibility.

How do I evaluate LLM coding agents? 🧑‍💻

LLMs for Engineers • 19 implied HN points • 31 Aug 23

🕹 Technology AI Software Development Data Engineering

LLM coding agents have advanced from simple code completion to creating entire code repositories. This shows how technology is evolving to assist with more complex software development tasks.
Evaluating these agents relies on benchmarks like HumanEval and MBPP, which test their coding accuracy. These tests are important to see how well the agents are performing.
While there are new tools and benchmarks for LLM coding agents, users might still need to create specific evaluations for their own needs to get the best results. It's essential to tailor assessments to fit unique projects.

ChatGPT Dead?

Sector 6 | The Newsletter of AIM • 19 implied HN points • 17 Aug 23

🕹 Technology AI Software Data Innovation Trends

OpenAI might stop ChatGPT soon because of certain challenges. It's not definite yet, but it's a possibility worth considering.
Google is working on a new AI called Gemini, which they say will be better than ChatGPT. This adds pressure on OpenAI as they can't use user data as freely for updates.
Microsoft seems to be inactive in this race, just watching the developments happen without actively participating.

How to NOT ChatGPT; Three Data Point Thursday #86

Three Data Point Thursday • 19 implied HN points • 09 Mar 23

🕹 Technology Data Investment Research Open Source AI

Investment is primarily going into products with limited future potential, with a focus on gAIs and startups.
Germany is conducting more data mesh research, indicating a growing interest in the field.
Consider investing time in open-source data startups, as they show promise for growth and success.

JPMorgan's AI Chatbot to Replace Research Analysts 😳

Sector 6 | The Newsletter of AIM • 1 HN point • 30 Jul 24

🕹 Technology AI Finance Innovation Automation Data

JPMorgan has introduced an AI chatbot named LLM Suite to assist its employees in idea generation and document summarization. This means that many tasks traditionally done by research analysts may now be handled by AI.
About 15% of JPMorgan's workforce in asset and wealth management will use this AI, showcasing the bank's large investment in artificial intelligence. It shows how serious the company is about improving efficiency with technology.
JPMorgan is not new to using AI, as they already have over 300 AI projects. This AI push is part of a broader trend in the finance sector to integrate advanced technology into everyday operations.

ChatGPT becomes a Platform[Finance Fridays]

Technology Made Simple • 19 implied HN points • 25 Mar 23

🕹 Technology AI Platforms Data Internet Programming

OpenAI has added new functionality to ChatGPT with plugins, turning it into a platform
This development is comparable to Apple launching the app store, opening up numerous opportunities for ChatGPT
While the new plugins provide advantages, they may not completely solve ChatGPT's fundamental issues

Standing on the brains of giants

Startup Strategies • 71 implied HN points • 05 May 23

🕹 Technology AI Artificial Intelligence ML Ethics Data

AI is often using the intelligence of others, not truly artificial intelligence.
Machines are successful because they combine the thoughts and ideas of many people.
These AI systems can blur the lines between human and machine-generated ideas.

The Value of Stability in Scientific Funding, and Why We Need Better Data

The Good Science Project • 18 implied HN points • 17 Feb 24

🔬 Science Funding Data Research Policy Employment

Scientific funding instability negatively impacts researchers' ability to plan and conduct research effectively, leading to swings in funding and unnecessary time spent on grant proposals.
Improved data tracking is crucial to understanding the impact of funding gaps on researchers' employment outcomes, highlighting the need for long-term empirical studies in science policy.
Addressing funding stability issues and utilizing detailed longitudinal data can help prevent obstacles in scientific progress and support the longevity of researchers' careers.

Update #68: Whispering Indigenous Languages and Neural Net Training Dynamics

The Gradient • 27 implied HN points • 13 Feb 24

🕹 Technology AI Language Data Research Ethics

Papa Reo raised concerns about Whisper's ability to transcribe the Māori language, highlighting challenges faced by indigenous languages in technology.
Neural networks learn statistics of increasing complexity throughout training, with a focus on low-order moments first before higher-order correlations.
Including native speakers in language corpora and model evaluation processes can substantially improve the performance of natural language processing systems for languages like Māori.

The End of Software, Long Live Software

Chaos Engineering • 5 implied HN points • 04 Dec 24

🕹 Technology AI Software Data Automation Engineering

AI Agents are changing how we think about software. They are smart programs that can do tasks for us, but we still need humans to help out to make sure everything runs smoothly.
Using AI to create software can make things cheaper, but it also makes the software more complex. As we rely on AI, we need to ensure we can trust it to work reliably.
Data is super important for AI to work well. We need to collect good quality data to train these AI Agents so they can do their jobs effectively and produce accurate results.

Square footage is so broken and weird

Counting Stuff • 65 implied HN points • 21 Mar 23

💼 Business Real Estate Data Standardization Measurement Market Analysis

NYC housing market values vibes over square footage
Measuring square footage in real estate is inconsistent and prone to errors
Efforts to standardize real estate measurements are limited in scope and face challenges

AI: What Has Not Been Said

Dr. Pippa's Pen & Podcast • 29 implied HN points • 21 Sep 23

🕹 Technology AI Data Algorithm Metaverse

AI empowers individuals to create without relying on expensive coders.
AI is reshaping our interaction with reality through algorithmic processing.
AI is creating a new data-driven architecture that needs to be examined for soundness.

Clouded Judgement 10.18.24

Clouded Judgement • 7 implied HN points • 18 Oct 24

🕹 Technology Software AI Enterprise Data Cloud

Enterprise software has always relied on systems that store data, but the real value comes from how people use that data in workflows. It's not just about the data, but how it's managed and processed.
AI is set to change this by taking over the data entry tasks that humans typically do. This means less focus on user interfaces and more on how efficiently AI can handle and process data automatically.
With this shift to AI-driven systems, we will see new ways of building applications that prioritize smart databases. This could make traditional systems less important and create a need for new tools to manage complex workflows.

Cooling Servers Through Code

Sector 6 | The Newsletter of AIM • 19 implied HN points • 02 Jun 23

🕹 Technology AI Environment Computing Data Sustainability

Generative AI can have a big environmental impact. For example, GPT-3 used a lot of energy, like driving 123 cars for a year.
There is concern that generative AI may not just affect the environment but could also pose other risks in the future.
Researchers are exploring ways to cool servers more efficiently through coding techniques to reduce their environmental footprint.

Only 60% of Black boys in Michigan graduate high school on time

Of Boys and Men • 45 implied HN points • 18 Apr 23

🇺🇸 U.S. Politics Race Gender Education Policy Data

Only 60% of Black boys in Michigan graduate high school on time
Data on high school graduation rates is harder to come by for boys
There are wide variations in graduation rates and gaps based on race and gender

From Heuristics to Precision

Sunday Letters • 19 implied HN points • 02 Jul 23

🕹 Technology AI Networking Data Hiring

Networking is really important because personal connections help match jobs with the right people. Good networks can filter out the best candidates easier than sifting through tons of data.
Large language models (LLMs) can help improve hiring by analyzing resumes with more depth and precision. This could lead to better and fairer hiring processes.
We are seeing a new kind of precision in handling data that will change how we think and work. While it can improve job fits, it might also raise concerns about privacy and control in other areas.

What is Sarah's Newsletter?

Sarah's Newsletter • 59 implied HN points • 25 Jan 22

🕹 Technology Data Tech

Sarah's Newsletter focuses on starting conversations about trends in data and tech, led by Sarah Krasnik Bedell, a data engineer and thought leader.
The newsletter covers topics like data engineering, analytics, and diversity in the tech industry beyond Silicon Valley.
Sarah started the newsletter to connect with smart individuals and discuss interesting ideas in the tech community.

Watermark the world

escape the algorithm • 59 implied HN points • 18 Mar 22

🕹 Technology Internet Ownership Privacy Ethics Data

Google Street View is made up of images from various sources, including everyday people, blended together to create a seamless representation of the world.
Watermarks added to Google Street View images are intentional, potentially highlighting the hidden labor behind the scans or symbolizing colonialism by claiming ownership of public spaces.
Question arises: Is the act of watermarking in Google Street View a way to show presence or a form of staking a claim on territory?

Five Links for April 2023

Five Links (and three graphs) by Auren Hoffman • 56 implied HN points • 07 Apr 23

🕹 Technology Podcasts Reading Books AI Data

Founder archetypes: insider vs outsider when starting a company
Fascinating fraud story about Wirecard in Germany
Podcasts and resources for learning about data, economics, and AI

It’s time to dilly, DALL-E

Sector 6 | The Newsletter of AIM • 59 implied HN points • 18 Apr 22

🕹 Technology AI Innovation Software Data Trends

Generative adversarial networks (GANs) are a type of AI used to create art, like 'Portrait of Edmond de Belamy.'
Ian Goodfellow is recognized as the 'father of GANs' and has influenced the technology's development.
The name 'Belamy' is a clever play on words, meaning 'good friend' in French, linking to Goodfellow's name.

Comments to the White House on Federal Data

The Good Science Project • 26 implied HN points • 13 Sep 23

🇺🇸 U.S. Politics Policy Data Funding Research

The White House should make federal data more accessible in a machine-readable format.
NIH and NSF need to be more open in allowing external researchers access to critical data on agency operations.
There is a need to update scientific funding agencies like NIH and NSF to enable external researchers to access data more easily.

How to create fake medical images[Technique Tuesdays]

Technology Made Simple • 19 implied HN points • 20 Dec 22

🕹 Technology AI Medical Data Machine Learning

Collecting high-quality medical data is hard due to expertise required for annotations.
Sharing medical data is restricted by regulations, presenting challenges for research.
Using AI-generated synthetic images can help overcome data quality and sharing issues in medical research.

Lobster long-read #1: A short history of unlocking things

Design Lobster • 119 implied HN points • 12 Nov 20

🕹 Technology Privacy Security Internet Data Digital

Locks have evolved over time, from simple mechanisms like holes in doors to more complex designs with pins and tumblers, highlighting the importance of privacy and security in history.
The mental model of a lock, where a key unlocks a 'private' space, is now applied to digital privacy, but the reality is that we entrust our digital possessions to third parties online.
An alternative paradigm for online privacy involves incorporating detection mechanisms, like Apple's iOS alerts, to make visible the handling of our digital data by third parties and promote transparency.

Data is Dead 💀

Sector 6 | The Newsletter of AIM • 19 implied HN points • 14 Apr 23

🕹 Technology AI Data Software Innovation Internet

Gathering a lot of data is not as valuable as it used to be. New tools are making it easier for competitors to catch up quickly.
Large Language Models (LLMs) are changing the game by allowing companies to use existing data to build similar or competitive products.
Companies should rethink their strategies about data hoarding, as just having a lot of data is no longer a strong advantage.

Google Gone Bard

Sector 6 | The Newsletter of AIM • 19 implied HN points • 09 Apr 23

🕹 Technology AI Software Innovation Business Data

Google is seen as a steady player in AI, while Microsoft is more aggressive, which could change the balance of power.
Google faces a challenge because its successful search business might clash with new AI technologies.
It’s important for Google to embrace generative AI to stay competitive without losing its existing business.

Lessons from Plaid for a Future Energy Unicorn

Equal Ventures • 39 implied HN points • 11 Mar 22

🕹 Technology Energy Data API

The energy sector is undergoing a digital transformation moving towards decentralized operations with renewables, envisioning a grid that resembles the internet.
Data infrastructure plays a crucial role in shaping the future of the energy industry, with a focus on API solutions specific to the energy sector.
Equal Ventures shared insights on the topic with Climate Tech VC, highlighting the importance of preparing for the evolving energy landscape and advocating for data-driven solutions.

Growth Is Up. So What's Up With the Economy?

Gideon's Substack • 19 implied HN points • 26 Oct 23

💼 Business Economy Inflation Perceptions Data Fiscal policy

The American economy is performing exceptionally well post-pandemic, surpassing other developed countries.
There is a notable disconnect between people's perceptions of the economy and the actual economic data, leading to various theories and concerns.
Factors such as the pandemic hangover, inflation, wage discrepancies, and fiscal uncertainties contribute to the complex economic landscape, influencing consumer sentiment and political outcomes.

So you’ve solved the chicken-and-egg problem…

Platform Papers • 19 implied HN points • 22 Dec 22

💼 Business Ecosystems Promotion Data Orchestration

Successful platform ecosystem orchestration involves more than just network effects
Selective promotion is a powerful tool for directing attention to high-quality complements in a platform ecosystem
Facilitating scale benefits among complementors can help drive greater value and prevent dominance in a platform ecosystem