The hottest Data Analysis Substack posts right now

And their main takeaways
Category
Top Technology Topics
Steve Kirsch's newsletter 10 implied HN points 19 Jan 25
  1. The Czech Republic has released detailed vaccine data for the first time, showing that the Moderna vaccine may be more dangerous than the Pfizer vaccine. This data is important for understanding vaccine safety.
  2. Analysis of this data suggests that the Moderna vaccine could increase all-cause mortality by about 50% compared to Pfizer, which raises serious concerns about its safety even outside of COVID periods.
  3. Despite this significant information available, it appears that many in the medical community are ignoring the findings, which highlights the need for more transparency in public health data.
CalculatedRisk Newsletter 14 implied HN points 31 Oct 24
  1. The Freddie Mac House Price Index went up by 3.6% compared to last year. This shows that house prices are on the rise.
  2. Many cities in Florida are struggling with real estate; 17 out of the 30 worst performing cities are located there.
  3. The Freddie Mac index is based on specific loans and includes sales data to track house prices accurately.
Ill-Defined Space 28 implied HN points 07 Mar 24
  1. The claim that China has 359 intelligence satellites may be inaccurate, as this number includes civil and military satellites, not just those intended for intelligence purposes.
  2. While China's spacecraft deployments have increased, they have not tripled, as suggested by a U.S. Space Command general.
  3. Despite concerns about China's space activities, the data indicates that U.S. military spacecraft deployments have not significantly increased, and the role of commercial spacecraft in the industry is substantial.
ASeq Newsletter 14 implied HN points 30 Oct 24
  1. Vendors sometimes quote theoretical maximums for data output, which can be misleading. It's important to understand that these numbers might not reflect actual performance.
  2. Comparing different technologies can be complicated because they have different specifications and capabilities. Each technology, like PacBio, Oxford Nanopore, and Illumina, has its unique strengths and limitations.
  3. In the real world, the difference between what is theoretically possible and what is actually achieved can be significant. This means we should be cautious and not rely solely on theoretical figures.
Get a weekly roundup of the best Substack posts, by hacker news affinity:
Klement on Investing 3 implied HN points 13 Aug 25
  1. Using high-quality data sources can really improve an analyst's accuracy. It’s better to have various data subscriptions than just relying on experience.
  2. Smaller, specialized data providers have a bigger positive impact on forecast accuracy than larger firms. But these smaller providers often get cut for cost-saving reasons.
  3. Cutting back on data access can harm investment research. It can lead to less unique insights, making it hard for analysts to stand out, which isn't good for business.
Web3 for Analytics Engineers 1 HN point 13 Jun 24
  1. Web3 is a decentralized internet on blockchain tech, aiming for user ownership and benefits for many people.
  2. Blockchain technology, at the core of Web3, offers immutability, decentralization, transparency, and cryptographic security.
  3. Web3 analytics introduces opportunities like decentralized data storage, on-chain data analysis, smart contract analytics, DeFi analytics, and NFT analytics.
thomaswdinsmore 1 HN point 12 Jun 24
  1. Dataiku is preparing for a potential exit, possibly an IPO, evidenced by recent investments and new executive hires.
  2. Dataiku focuses on business users with its analytics platform, leveraging partnerships with big data players like Databricks and Snowflake.
  3. While Dataiku shows growth in revenue, its capabilities in machine learning and generative AI, like Hugging Face models, are not as robust, and they partner with other companies for these advanced technologies.
Fish Food for Thought 11 implied HN points 11 Dec 24
  1. The DX Core 4 Framework helps companies measure developer productivity by looking at four main areas: Speed, Effectiveness, Quality, and Impact. This balanced approach provides a complete picture of how well teams are performing.
  2. It includes a Developer Experience Index (DXI) that shows how developers feel about their work, helping identify areas for improvement. This means companies can catch issues before they become bigger problems.
  3. The framework focuses on connecting developer productivity to business goals, making it easier for all levels of the organization to understand how engineering work impacts the company's success.
LatchBio 11 implied HN points 12 Dec 24
  1. Single cell sequencing helps scientists understand individual cells better. This technique is key for studying diseases and biological processes.
  2. Bench scientists need simple tools to analyze single cell data without needing extensive computational skills. This will help them work more independently and quickly.
  3. Providing scientists with easy access to their data will lead to new questions and insights in research. This can improve drug development and other important biological discoveries.
Steve Kirsch's newsletter 12 implied HN points 31 Oct 24
  1. There is no clear medical reason for COVID vaccines to prevent infection. Natural infections can create immunity, but not the kind from an injected vaccine.
  2. After vaccines were given out, the data showed that the rate of deaths actually increased and stayed the same for a year, even though it was going down before the vaccines.
  3. Some people in the medical field believe vaccines can cause harm, but are pressured not to publish their findings because of funding and institutional pressures.
Klement on Investing 4 implied HN points 20 Jun 25
  1. On average, women speak more words per day than men. Women use about 13,349 words while men use around 11,950 words daily.
  2. As people age, how much they talk can change. Younger men and women talk similarly, but older men often become more talkative than older women.
  3. Some people barely talk, while others can speak a ton, like 50,000 words a day. It's interesting to see such a big range in how much different people communicate.
Links I Would Gchat You If We Were Friends 59 implied HN points 25 Nov 20
  1. Google's star ratings for recipes in search results don't reflect the actual quality of the recipes, and sites can manipulate these ratings for more traffic.
  2. Recipe sites often rely on rich snippets and Google star ratings to attract clicks, leading to a lack of consistency and standardization in the ratings.
  3. Simply Recipes stood out with unusually high average ratings, raising suspicions about the authenticity of the ratings across popular food publications.
Conspirador Norteño 36 implied HN points 10 Sep 23
  1. Inauthentic social media accounts can change purposes, either by original operators or new owners.
  2. Cryptocurrency spam accounts can use plagiarized photos and often get suspended by platforms like Twitter.
  3. Some reborn inauthentic accounts push propaganda, like pro-Russia/anti-Ukraine messages, and may evade bans.
Matt’s Five Points 19 implied HN points 04 Nov 22
  1. You can run a quick election simulation by using an Excel sheet. Just change the win probabilities for each state and the sim does the math for you in about 2 seconds.
  2. Basic election modeling isn't as hard as it sounds. You can easily create your own model with some data and a few calculations to forecast election outcomes.
  3. Strong, accurate models take more work and understanding, but anyone can start trying their hand at it. It can be enjoyable to explore different scenarios with the data.
Steve Kirsch's newsletter 12 implied HN points 19 Oct 24
  1. Health authorities may avoid answering tough questions about vaccine effectiveness. It's important to push for clear and honest responses.
  2. Data from nursing homes suggests that COVID vaccinations did not significantly reduce deaths. This raises concerns about the actual impact of the vaccines.
  3. There are claims that more vaccinations could be linked to increased COVID infections. It's crucial to understand why vaccination rates and infection rates may not align as expected.
Steve Kirsch's newsletter 8 implied HN points 25 Jan 25
  1. The vaccines may have caused more COVID cases and deaths than they helped prevent. Data shows that vaccinated individuals had higher case rates during 2021 and 2022.
  2. Some studies suggest that vaccines may increase the risk of adverse health outcomes, like myocarditis and all-cause mortality, especially with certain brands.
  3. There is ongoing debate and skepticism surrounding vaccine safety, with some polls indicating that a significant number of people believe vaccines have contributed to deaths similar to COVID itself.
Recommender systems 26 implied HN points 20 Jan 24
  1. Reducing selection bias and popularity bias in ranking is important for recommender systems.
  2. An advocated approach is to factorize user interaction signals to account for biases originating from power users and power items.
  3. The proposals for causal/debiased ranking involve factorization, mutual information, and mixture of logits to improve the ranking model.
Natural Selections 12 implied HN points 22 Oct 24
  1. Climate science often relies on models that may not fully prove human actions are the main cause of temperature increases. It's important to question what we assume about these models.
  2. Some media outlets present conclusions about climate change as facts, which can mislead people. They may not explore other possible reasons for climate events.
  3. True science should consider multiple explanations for observations instead of insisting on a single cause. It's essential to keep an open mind in scientific discussions.
ASeq Newsletter 7 implied HN points 28 Feb 25
  1. Roche's Q39 accuracy system is different from other platforms like Illumina and Oxford Nanopore. It's important to compare them carefully as each has unique metrics.
  2. The average accuracy of different sequencing platforms varies, but Roche doesn't provide clear comparisons. They share limited data about their simplex accuracy.
  3. Understanding the differences in data quality and error rates across platforms is crucial. Factors like read length and error filtering play a significant role in the accuracy of sequencing results.
Dataplane.org Newsletter 19 implied HN points 04 May 22
  1. Outdated RPKI relying party clients can pose operational risks as software support ends. Monitoring software versions is crucial for security.
  2. Analysis revealed varying levels of outdated RPs among different client implementations. Routinator showed significant outdated usage.
  3. Dataplane.org is updating web pages, managing finances, and improving technical capacity, with a focus on tax preparation and back-end services.
Rounding the Earth Newsletter 8 implied HN points 22 Jan 25
  1. The concept of Healthy User Bias (HUB) suggests that healthy people are more likely to get vaccinated, which can skew vaccine effectiveness data.
  2. Recent COVID-19 data trends show a pattern where states are experiencing similar mortality rates, indicating a connection between health factors and vaccination rates.
  3. Deaths related to despair, like suicide and drug use, appear to be affecting mortality rates, especially in poorer areas, alongside any potential vaccine-related deaths.
Steve Kirsch's newsletter 11 implied HN points 15 Oct 24
  1. Confounders are factors that can distort data, making vaccines seem unsafe, but they should affect results randomly. It raises questions about why they only appear to show a negative impact on vaccines.
  2. There is a significant difference in mortality rates between different vaccine brands, suggesting there may be deeper issues like manufacturing defects or distribution biases impacting safety results.
  3. Despite individual observations of negative vaccine effects, people are often told to trust aggregated data from authorities, which can lead to doubts about the reliability of personal experiences and observations.
ASeq Newsletter 7 implied HN points 15 Jan 25
  1. PacBio is working on high-density chips that can hold more information than before. This means they can process data faster and more efficiently.
  2. The focus on ongoing technical development indicates that PacBio is trying to stay ahead in the biotech field. They are continuously improving their technology to meet market needs.
  3. The information presented is part of a broader update at the JPM conference, showing that PacBio is committed to advancing their technology and sharing their progress with subscribers.
Conspirador Norteño 20 implied HN points 09 Feb 24
  1. A network of taxi and real estate-themed social media accounts were used to boost political content on Twitter through automation.
  2. The botnet consisted of at least 98 Twitter accounts with automated posting schedules that operated 24/7.
  3. The botnet retweeted content based on hashtags, focusing on small accounts and political tweets rather than popular ones.
Apricitas Economics 26 implied HN points 03 Oct 23
  1. The US economy was actually larger than previously believed due to comprehensive revisions in GDP data.
  2. America's investment boom was stronger than initially reported, with notable upgrades in real fixed investment across sectors like housing and manufacturing.
  3. Revisions to US GDP data included improved methodologies, extensive data integration, and new data series to enhance the accuracy of measuring economic growth.
Wadds Inc. newsletter 59 implied HN points 17 Aug 20
  1. Innovation in the PR industry is strong, with many new agencies starting up during the pandemic. If you're considering freelancing or starting an agency, there are important tips to think about.
  2. Enero, a marketing services group, posted significant revenue growth recently. This shows that some companies are thriving even in challenging times.
  3. Many consumers now prefer to follow the news on TV rather than social media. This shift indicates changing habits in how people consume news.
Magis 27 implied HN points 04 Aug 23
  1. The post discusses decision making using external data.
  2. The author spoke at Domino Data Lab's Rev4 Conference about the hedge fund industry's use of external data.
  3. Rev conference is focused on strategic data leaders and offers great talks.
Data Science Weekly Newsletter 19 implied HN points 30 Jun 22
  1. Machine learning exercises can deepen your understanding of concepts like linear algebra and optimization. Practicing these can help you think critically about model building.
  2. Ethical AI development toolkits play a crucial role in shaping how companies approach ethics in technology. It's important to recognize the gaps between what these toolkits suggest and the real work involved in implementing ethical practices.
  3. Recent studies on adaptive optimizers show that models can go through phases of overfitting before suddenly generalizing very well. Understanding this 'grokking' phenomenon can help refine training processes for better performance.
Dataplane.org Newsletter 19 implied HN points 03 Jan 22
  1. Dataplane.org is actively involved in RPKI RP measurement work since May 2021, tracking synchronization data and software usage diversity in RPKI relying parties.
  2. A significant and unexplained drop in SSH activity globally was observed in early October 2021, particularly affecting users of 'libssh', possibly due to a new SSH worm infection.
  3. Dataplane.org introduced a new signal data named sshidpw, providing daily reports of SSH id/password pairs seen in authentication attempts, proving beneficial for system admins and researchers.