The hottest Data Analysis Substack posts right now

And their main takeaways
Category
Top Technology Topics
CAUSL Effect 19 implied HN points 25 Apr 23
  1. The author is shifting focus from company updates to more engaging discussions that inspire thought and community interaction. They believe it's important to write about topics that spark conversations rather than just update on business progress.
  2. They define a lead as an actual conversation about their services, not just messages without responses. They're monitoring their lead data closely and have gained 15 leads so far, which they consider a decent start after a few months.
  3. Managing leads can feel stressful, especially when unsure if the opportunity will come through. The author prefers clear 'closed' leads over 'open' ones, as the uncertainty in 'open' leads can be more anxious than outright rejection.
inexactscience 19 implied HN points 02 Mar 23
  1. Academia and business both use data to solve problems, but they focus on different aspects. In academia, getting the right answer is more important than how fast you get it.
  2. The speed-quality frontier shows that in academia, quality matters a lot, which means projects can take years. In business, speed is key, so decisions often get made quickly.
  3. Feedback loops are faster in business. Companies test ideas against real market data quickly, while in academia, feedback often comes later from peer reviews, slowing down the process.
The Product Channel By Sid Saladi 23 implied HN points 23 Jul 23
  1. ChatGPT plugins enhance product development with automation and specialized research capabilities.
  2. Installing ChatGPT plugins involves upgrading to ChatGPT Plus and enabling the Plugins Beta feature.
  3. Top 20 ChatGPT plugins offer diverse functionalities like creating diagrams, conducting data analysis, and providing personalized recommendations.
Dataplane.org Newsletter 19 implied HN points 03 Oct 22
  1. Dataplane.org has over 300 sensors in operation across 6 continents, providing valuable data from a wide range of networks.
  2. Unexpected anomalies like DNS query spikes can provide insight into network behavior and the importance of understanding data context.
  3. Dataplane.org plans to rebuild their RPKI setup due to ongoing issues caused by a previous experiment, aiming for simpler, more reliable monitoring in the future.
Steve Kirsch's newsletter 4 implied HN points 24 Oct 24
  1. A graph shows that vaccinated people are much less likely to die from COVID compared to those who are unvaccinated. This sounds convincing to get vaccinated.
  2. However, the graph might be misleading and doesn't tell the full story behind the numbers.
  3. The author offers more insights about why the graph is deceptive and argues against getting vaccinated.
Get a weekly roundup of the best Substack posts, by hacker news affinity:
Jacobo’s Substack 1 HN point 23 Jun 24
  1. The dataset shared focuses on PSG ticket price evolution for the 2023 - 2024 season, collected through scraping the Ticketplace marketplace.
  2. The data format is simple, featuring columns for timestamp, fixture, category, quantity, and price, providing a basis for analyzing ticket pricing trends and making predictions.
  3. The release of this dataset is aimed at facilitating student projects and filling the gap for attractive, open-source datasets for data analysis.
Fileforma Research 1 HN point 22 Jun 24
  1. Neuralink's Compression Challenge requires beating ZIP at compressing audio files, revealing unexpected complexities in brain data compression
  2. Claude Shannon's use of logarithms in measuring entropy lacks proof, highlighting the need for alternative entropy measures like the uniformity measure
  3. The proposed uniformity measure offers a way to calculate a sample's proximity to a uniform distribution, providing a new method for entropy measurement
Brick by Brick 9 implied HN points 07 Feb 24
  1. Microsoft reported significant growth with GitHub CoPilot, reflecting high adoption and productivity among developers
  2. An experiment showed developers using CoPilot completed tasks 55.8% faster, raising questions about generalizability
  3. Assessing the true impact of CoPilot on productivity requires rigorous experiments tailored to individual engineering organizations
Steve Kirsch's newsletter 15 implied HN points 14 Jan 24
  1. New Medicare data suggests that COVID vaccines may have increased mortality rates, contradicting promises of safety and efficacy.
  2. Unvaccinated individuals appeared to fare better in terms of mortality since April 2022, challenging the need for booster shots after that time.
  3. Flu vaccines also show concerning mortality rates, suggesting unsafe practices and lack of benefit.
All-Source Intelligence Fusion 20 implied HN points 20 Jun 23
  1. Babel Street announced the launch of its "Insights GPT" large language model.
  2. Babel Street aims to transition from a cellphone location-tracking firm to an artificial intelligence company.
  3. The Insights GPT platform may have significant government surveillance use cases, such as summarizing data on the Chinese Communist Party.
Web3 for Analytics Engineers 1 HN point 13 Jun 24
  1. Web3 is a decentralized internet on blockchain tech, aiming for user ownership and benefits for many people.
  2. Blockchain technology, at the core of Web3, offers immutability, decentralization, transparency, and cryptographic security.
  3. Web3 analytics introduces opportunities like decentralized data storage, on-chain data analysis, smart contract analytics, DeFi analytics, and NFT analytics.
thomaswdinsmore 1 HN point 12 Jun 24
  1. Dataiku is preparing for a potential exit, possibly an IPO, evidenced by recent investments and new executive hires.
  2. Dataiku focuses on business users with its analytics platform, leveraging partnerships with big data players like Databricks and Snowflake.
  3. While Dataiku shows growth in revenue, its capabilities in machine learning and generative AI, like Hugging Face models, are not as robust, and they partner with other companies for these advanced technologies.
Links I Would Gchat You If We Were Friends 59 implied HN points 25 Nov 20
  1. Google's star ratings for recipes in search results don't reflect the actual quality of the recipes, and sites can manipulate these ratings for more traffic.
  2. Recipe sites often rely on rich snippets and Google star ratings to attract clicks, leading to a lack of consistency and standardization in the ratings.
  3. Simply Recipes stood out with unusually high average ratings, raising suspicions about the authenticity of the ratings across popular food publications.
Matt’s Five Points 19 implied HN points 04 Nov 22
  1. You can run a quick election simulation by using an Excel sheet. Just change the win probabilities for each state and the sim does the math for you in about 2 seconds.
  2. Basic election modeling isn't as hard as it sounds. You can easily create your own model with some data and a few calculations to forecast election outcomes.
  3. Strong, accurate models take more work and understanding, but anyone can start trying their hand at it. It can be enjoyable to explore different scenarios with the data.
Counting Stuff 21 implied HN points 30 Mar 23
  1. Single panes of glass in technology often promise magic but fail to deliver in a meaningful way
  2. The concept of 'single panes of glass' in tech is fundamentally flawed because it doesn't mirror the efficiency and specialization seen in physical interfaces like those in transportation
  3. Project requests for 'single panes of glass' tend to lead to complex, unsustainable solutions that are difficult to manage and maintain over time
Steve Kirsch's newsletter 11 implied HN points 25 Jan 24
  1. Author offered to redact any records revealing private health information to challenge Health New Zealand
  2. Epidemiologists might have to testify about vaccine safety and efficacy in a New Zealand court
  3. This opportunity could challenge the safe and effective narrative about vaccines and help exonerate Barry
Steve Kirsch's newsletter 10 implied HN points 19 Feb 24
  1. The New Zealand OIA request revealed that COVID vaccines were found to increase the risk of dying, instead of providing protection against COVID.
  2. The data released under OIA showed that vaccinated individuals experienced a significant increase in mortality during the COVID outbreak, contrary to what was expected.
  3. Mainstream epidemiologists have avoided analyzing the data that shows the vaccines increased the risk of dying from COVID, leading to a lack of public discussion and questioning.
Dataplane.org Newsletter 19 implied HN points 04 May 22
  1. Outdated RPKI relying party clients can pose operational risks as software support ends. Monitoring software versions is crucial for security.
  2. Analysis revealed varying levels of outdated RPs among different client implementations. Routinator showed significant outdated usage.
  3. Dataplane.org is updating web pages, managing finances, and improving technical capacity, with a focus on tax preparation and back-end services.
Center for the Study of Partisanship and Ideology 11 implied HN points 29 Aug 23
  1. The winners of the Salem/CSPI Prediction Tournament were announced, including a $25,000 prize and a fellowship
  2. The analysis of the betting markets showed mixed results, with some events being accurately predicted while others were not
  3. Participants in the tournament were mostly young, male, and had a libertarian-leaning political orientation
Steve Kirsch's newsletter 8 implied HN points 17 Mar 24
  1. The reduction in MIS-C cases can be attributed to the virus, not the COVID vaccine. The virus shift to BA.2 variants coincided with the drop in cases.
  2. The data indicates that the protective effect of the vaccine did not suddenly grow stronger after a year. Immunity actually started to rapidly increase over time.
  3. Credit should be given to the virus for the drop in MIS-C cases, not the vaccine. The CDC did not recognize this relationship.
Wadds Inc. newsletter 59 implied HN points 17 Aug 20
  1. Innovation in the PR industry is strong, with many new agencies starting up during the pandemic. If you're considering freelancing or starting an agency, there are important tips to think about.
  2. Enero, a marketing services group, posted significant revenue growth recently. This shows that some companies are thriving even in challenging times.
  3. Many consumers now prefer to follow the news on TV rather than social media. This shift indicates changing habits in how people consume news.
ppdispatch 2 implied HN points 01 Nov 24
  1. Chain-of-thought prompting might actually make some tasks harder for AI, especially in visual tasks where less thinking works better.
  2. The DAWN framework allows AI agents to work together globally in a secure way, which can lead to improved collaboration.
  3. New mesomorphic networks are great for understanding tabular data and give clearer explanations, making them useful for various applications.
Data Science Weekly Newsletter 19 implied HN points 30 Jun 22
  1. Machine learning exercises can deepen your understanding of concepts like linear algebra and optimization. Practicing these can help you think critically about model building.
  2. Ethical AI development toolkits play a crucial role in shaping how companies approach ethics in technology. It's important to recognize the gaps between what these toolkits suggest and the real work involved in implementing ethical practices.
  3. Recent studies on adaptive optimizers show that models can go through phases of overfitting before suddenly generalizing very well. Understanding this 'grokking' phenomenon can help refine training processes for better performance.
Dataplane.org Newsletter 19 implied HN points 03 Jan 22
  1. Dataplane.org is actively involved in RPKI RP measurement work since May 2021, tracking synchronization data and software usage diversity in RPKI relying parties.
  2. A significant and unexplained drop in SSH activity globally was observed in early October 2021, particularly affecting users of 'libssh', possibly due to a new SSH worm infection.
  3. Dataplane.org introduced a new signal data named sshidpw, providing daily reports of SSH id/password pairs seen in authentication attempts, proving beneficial for system admins and researchers.
Dan's Stack 2 HN points 12 Feb 24
  1. Speed dating events are optimal with around 20 attendees (10 men, 10 women), maintaining a balanced gender ratio is important for a successful event.
  2. Ticket prices for speed dating events vary based on demand and gender ratio, with average prices around $25.
  3. Marketing strategies for speed dating events focus on gender-specific ad campaigns to ensure equal attendance of men and women, with Instagram and Facebook ads being the most effective channels.
Venture Reflections 12 implied HN points 17 May 23
  1. Successful pre-seed companies spend more money per month than unsuccessful ones.
  2. The difference in average monthly burn rates between successful and unsuccessful companies is small since 2016.
  3. Spending more money is likely an effect, not the cause, of success in finding product-market fit.
Dataplane.org Newsletter 19 implied HN points 29 Nov 21
  1. Dataplane.org, a platform for providing data feeds on internet activity, has gained recognition in the security community for its reliability.
  2. Dataplane.org is evolving from a personal project to a more formal organization with potential revenue streams to support growth.
  3. Future plans for Dataplane.org include website redesign, creating a search API, and expanding the types of data covered.
nick’s datastack 1 HN point 24 Apr 24
  1. Generative AI can generate data, impacting workflows and pipelines significantly.
  2. Using LLMs for prompt-based feature engineering can save time and effort compared to traditional methods like manual data searching and merging.
  3. While LLMs in data pipelines may feel magical, it's important to be cautious of potential inaccuracies due to the probabilistic nature of AI outputs.
Dominic Cummings substack 12 implied HN points 17 Apr 23
  1. New data shows Trump beating Biden and AOC & Kamala more easily
  2. Research conducted for the 2024 Presidential campaign focused on crucial issues like cost of living, health, and crime
  3. Efforts were made to ensure an accurate sample of low-education, low-trust voters to avoid polling errors