The hottest Data Analysis Substack posts right now

And their main takeaways

LLM-Generated Self-Explanations

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 0 implied HN points • 21 Dec 23

🕹 Technology AI Research Machine Learning Natural Language Processing Data Analysis Software Engineering

LLMs can make predictions and explain how they arrived at those predictions. This helps in understanding their reasoning better.
Using a 'Chain of Thoughts' method can improve LLMs' ability to solve complex tasks, especially in areas like math and sentiment analysis.
There's a need for better ways to evaluate the explanations given by LLMs because current methods may not accurately determine which explanations are effective.

LLM Drift

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 0 implied HN points • 29 Sep 23

🕹 Technology AI Machine Learning Natural Language Data Analysis Research

LLM Drift refers to big changes in how language models respond over a short time. This means their answers can differ quite a bit unexpectedly.
Studies show that the accuracy of models like GPT-3.5 and GPT-4 can go up and down significantly in just a few months. Sometimes they get worse at certain tasks.
It's important to keep checking how these models behave over time because their performance can shift for many reasons, not just from minor tweaks.

What Constitutes A Large Language Model Application?

Cobus Greyling on LLMs, NLU, NLP, chatbots & voicebots • 0 implied HN points • 30 Mar 23

🕹 Technology AI Software Machine Learning Applications Data Analysis

Large Language Models (LLMs) are advanced AI tools that can understand and create human language. They help with tasks like writing, summarizing, and recognizing different pieces of information.
There are different parts to building applications with LLMs. This includes using models, tools for development, and creating apps that end users can interact with.
Prompt engineering is important for getting the best results from LLMs. It involves creating and managing prompts to guide the AI in generating useful responses.

✅ Making sense of the UK market for corporate communications and public relations

Wadds Inc. newsletter • 0 implied HN points • 04 Mar 24

💼 Business Corporate Communications Public Relations Market research Employment Trends Data Analysis

A new project is starting to collect and share important job data in the UK public relations and corporate communications market. This will help people understand job trends and opportunities better.
Many people in PR change jobs every year, and there are lots of freelancers, making it a mixed and active job market. The project aims to help track these changes.
There will be a monthly newsletter featuring job openings in PR, and the project will gradually expand to include more data sources over time.

✅ Monday briefing: Pandemic whiplash for public relations

Wadds Inc. newsletter • 0 implied HN points • 17 Jul 23

💼 Business Public Relations Data Analysis Crisis Management Industry Trends Social media

The public relations industry has seen a significant drop in investment, losing over $1 billion in the past year.
Employee-employer relationships in PR have changed, with many hiring freezes and layoffs instead of raises.
The connection between PR roles and company leadership is weakening, with fewer executives reporting directly to top management.

Get a weekly roundup of the best Substack posts, by hacker news affinity:

✅ Monday briefing: Data journalism, agency incubator, oil greenwashing, UK productivity bounce, influencer agencies, social media benchmarks, Reddit insight, and more...

Wadds Inc. newsletter • 0 implied HN points • 21 Feb 22

💼 Business Marketing Media Public Relations Data Analysis Social media

Data journalism is growing, helping people understand local issues like air quality through interactive maps. This shows how media can use data to inform the public.
The influencer marketing industry is rapidly evolving, with many new specialized agencies emerging. This trend highlights how brands are adapting to better engage audiences.
Social media is losing its positive impact on politics, thanks to misinformation and echo chambers. This situation suggests that we need to rethink how we use these platforms for democracy.

✅ Monday Briefing: Lockdown creative, PR reporting, Kickstart scheme, recruitment recovery, flawed ad metric, Clubhouse primer, Lockdown Unconference, and more…

Wadds Inc. newsletter • 0 implied HN points • 01 Feb 21

💼 Business Marketing Public Relations Data Analysis Social media Recruitment

Lockdown creative processes have evolved to include remote brainstorming and technology use, allowing teams to connect and collaborate effectively despite physical distance.
The recruitment landscape is recovering, particularly in digital marketing and PR, but some areas still face challenges as the job market adjusts after lockdowns.
Social media platforms like Clubhouse and Facebook are adapting to new practices, with insights on engagement and content formats that cater to different audiences and enhance user experience.

✅ Media Briefing #25: Notes by Nafisa, comms vaccination guide, my new book, linking out, words matter, coverage metrics, and more…

Wadds Inc. newsletter • 0 implied HN points • 21 Dec 20

💼 Business Media Communications Public Relations Data Analysis Technology Tools

It's important to recognize and address privilege and diversity in public relations. Sharing personal stories can help highlight these issues.
Choosing the right words in communication is crucial. The way we express ourselves can have a big impact, so it's good to be mindful of language.
When sharing news or articles, linking back to original sources is essential. It not only gives credit but also adds credibility to the information shared.

Data Update 1 for 2021: A (Data) Look Back at a Most Forgettable Year (2020)!

Musings on Markets • 0 implied HN points • 09 Jan 21

💰 Finance Data Analysis Market Trends Investment Strategies Corporate Finance Economic Impact

Data is most valuable when it's unique and exclusive. If everyone has access to the same data, it loses its worth.
It's important to look at the big picture with data to avoid tunnel vision. By understanding industry norms, investors can better judge individual stocks.
Data can expose misinformation and challenge common beliefs. Relying on facts rather than opinions helps clarify the truth in financial discussions.

Data Update 2 for 2020: Retrospective on a Disruptive Decade

Musings on Markets • 0 implied HN points • 27 Jan 20

💰 Finance Investing Markets Economic Trends Risk Assessment Data Analysis

The past decade saw strong growth in stocks, with the S&P 500 nearly tripling in value and a notable rise in bond returns as well. It was a great time for investors, especially those who held onto their portfolios.
Interest rates dropped significantly during this period, influenced by both global economic conditions and central bank actions. Many believe these low rates are here to stay as the economy's fundamentals support them.
Tech companies, particularly the FAANG group, led the stock market's rise, drastically increasing their market capitalization. This shift shows how important tech has become compared to traditional industries like energy.

Data Update 1 for 2020: Setting the table

Musings on Markets • 0 implied HN points • 13 Jan 20

💰 Finance Data Analysis Corporate Finance Risk Assessment Investment Strategies

Accessing raw data for companies is easy now, but choosing the right data sources and how to analyze it is important. It's like picking the best ingredients for a recipe.
Using different types of data, like macro and micro data, helps provide a clearer picture of a company's financial health. Each type of data tells a part of the company's story.
Data can be biased and misused, so it's important to look beyond just numbers. Making decisions based on data should include critical thinking and understanding the context.

January 2019 Data Update 8: Dividends and Buybacks - Fact and Fiction

Musings on Markets • 0 implied HN points • 08 Feb 19

💰 Finance Investing Stock Market Economic Policy Corporate Finance Data Analysis

Companies are spending a lot more on stock buybacks compared to dividends. This trend has been growing since the 1980s, with more than 60% of cash returned to shareholders coming from buybacks in recent years.
There's a debate about whether buybacks are good for the economy. Some say they help shareholders while others believe the money should be reinvested in businesses or used to increase wages for workers.
Not all companies use buybacks in the same way. Larger, mature companies tend to buy back more stocks, but many smaller or high-growth companies are still focused on building their businesses instead.

Damodaran Online: There is an App for that!

Musings on Markets • 0 implied HN points • 05 Mar 18

💼 Business Finance Education Technology Data Analysis Corporate strategy

The app named 'Damodaran Online' gathers all materials from his website, blog, and YouTube into one place for easy access on Apple devices.
He is currently on sabbatical, enjoying a break from regular teaching but continuing to share knowledge through various classes and external workshops.
His research and writing projects include updating his book on valuing tough companies and exploring the difference between pricing and valuing assets.

January 2018 Data Update 7: Growth and Value

Musings on Markets • 0 implied HN points • 27 Jan 18

💰 Finance Investing Corporate Finance Market Analysis Data Analysis Performance Metrics

Profitability is measured using various profit margins, which help assess how well a company is doing. It’s important to choose the right measure based on what you're analyzing, like gross margin for efficiency or net margin for overall profitability.
Excess returns show how much a company earns above its cost of capital, and most companies struggle to achieve this. Many firms aren't making enough money to cover their investments, highlighting a risk in company performance.
Regional, sector, and size factors influence company profits. For instance, smaller companies often perform worse than larger ones, and certain industries, like technology, can produce high returns while others, like retail, may struggle.

January 2018 Data Update 2: The Buoyancy of US Equities

Musings on Markets • 0 implied HN points • 09 Jan 18

💰 Finance Investing Stocks Bonds Data Analysis Economic Trends

US stocks had a strong performance in 2017, achieving a 21.65% return, which surprised many experts. This shows that the equity market can thrive even with various economic and political concerns.
Despite a good year for stocks, the fundamentals improved, with earnings and dividends rising. This suggests that the stock prices are supported by healthier financials.
Looking ahead, there's potential for Treasury bond rates to rise, which could impact equity performance. Investors need to watch changes in tax laws and overall economic conditions as these factors may influence the market.

January 2017 Data Update 1: The Promise and Perils of "Big Data"!

Musings on Markets • 0 implied HN points • 09 Jan 17

💼 Business Data Analysis Investment Finance Corporate strategy Market Trends

Numbers can seem super precise, but they often aren't. How we calculate them can really change the results, so we should always be careful with our interpretations.
Data isn't always objective; it can carry biases just like stories do. It’s important to look at different ways a number can be presented to get a clearer picture.
Just having data doesn't mean it will lead to profits. For data to be valuable, it needs to be exclusive or actionable, which isn't always the case.

The Tax Story in 2015: Myths, Misconceptions and Reality Checks

Musings on Markets • 0 implied HN points • 19 Jan 15

💰 Finance Taxation Corporate Finance Economic Policy Data Analysis

Many people think they pay their fair share of taxes while believing that others don't. It helps to look at real data to see how taxes are actually paid.
Even though the U.S. has a high corporate tax rate, companies in the U.S. pay a significant portion of their income in taxes, similar to or higher than companies in other countries.
There's talk of changing the corporate tax code in the U.S. to make it simpler and fairer. Suggestions include lowering the tax rate and only taxing foreign income at local rates.

Numbers Time! Data update for 2014

Musings on Markets • 0 implied HN points • 09 Jan 14

💰 Finance Data Analysis Investment Strategies Corporate Finance Global Markets Valuation Techniques

Data access has changed a lot over the years. In the past, it was hard to find data unless you were at a university or bank, but now it's way easier and more global.
The reason for sharing this data is partly self-interest. It helps the creator make better investment decisions and save time throughout the year.
When using this data, remember that it reflects personal judgments and can include errors. It's important to verify details and be cautious when making decisions based on the numbers.

Data Update 2013: The Dark Side of Numbers

Musings on Markets • 0 implied HN points • 13 Jan 13

💰 Finance Data Analysis Corporate Finance Investment Strategies Risk Assessment Economic Indicators

Some people use complex numbers to scare others into agreeing with them. You can fight this by sticking to common sense and focusing on the main idea.
Data can be twisted to support a certain viewpoint by only showing what fits. Always check for the full picture before believing claims.
Many analysts hide behind data instead of making tough decisions. It's better to personalize and adapt data to your own understanding rather than rely on generic numbers.

Moneyball and Investing: Data, Information and my 2012 Update

Musings on Markets • 0 implied HN points • 26 Jan 12

💰 Finance Investing Data Analysis Market Trends Financial forecasting Corporate Finance

Investing should focus more on data and numbers rather than just gut feelings or stories from analysts. Just like in baseball, using hard data can lead to better investment choices.
Data is useful, but it’s important to understand that all numbers are estimates. This means they can have errors and should be used carefully.
To make good investment decisions, combine data analysis with sensible stories. Numbers are a starting point, but having a narrative helps make better choices.

Equity Risk Premiums and the Fear of Catastrophe

Musings on Markets • 0 implied HN points • 09 Mar 10

💰 Finance Investments Risk management Market Trends Economic Theory Data Analysis

The equity risk premium is what investors expect to earn above a safe rate like treasury bonds for taking on the risk of stocks. It helps explain stock market behavior over time.
Using historical data for equity risk premiums can be misleading because it looks back rather than forward. A better method is to calculate an implied premium based on current stock prices and expected future cash flows.
Fear of economic disasters strongly affects equity risk premiums. During crises, fear increases and affects investors' expectations, leading to quick shifts in the premium values.

Data Update for 2010

Musings on Markets • 0 implied HN points • 08 Jan 10

💰 Finance Data Analysis Market research Investing Market Analysis Data Analysis Risk Assessment Risk Assessment Global Markets

The author updates datasets for companies from different regions each year, focusing on risk, profitability, and debt measures.
This year's updates include new data for Indian and Chinese companies, expanding the coverage of the datasets.
Future blog posts will discuss what these updates reveal about global companies and markets.

Skip the joins with Semantic ABI

HyperArc • 0 implied HN points • 26 Jun 24

🕹 Technology Blockchain Web3 Data Analysis Open Source Smart Contracts

Semantic ABI helps organize data from Ethereum transactions better. Instead of dealing with lots of confusing tables, it allows you to get a clear view of the data directly.
By using Semantic ABI, you can easily combine data from different sources without complex joins. This saves time and makes analysis simpler.
The library supports features like adding extra meaning to data and finding matches in transactions more efficiently. It's designed to help with analyzing Web3 data easily.

Coming soon: Unmoderated Insights

Unmoderated Insights • 0 implied HN points • 26 May 23

🕹 Technology Social media Data Analysis Human behavior Digital Trends Online Communication

The blog focuses on breaking down complex topics into simple explanations. It's meant for people who like understanding things without the confusion.
It emphasizes the importance of data over beliefs, especially regarding social technologies and their impact on our lives.
The author invites readers to subscribe and share the blog with others who might enjoy it or benefit from it.

Product Hunt aftermath 🔥

André Casal's Substack • 0 implied HN points • 28 Aug 24

💼 Business Entrepreneurship Marketing Product Development Data Analysis Customer Engagement

Engaging with the right audience is key. It's important to connect with active Product Hunt users before launching to increase votes.
Collecting emails can help build interest. Adding a newsletter signup on the landing page could capture potential buyers' information.
Learning from each experience is vital. Reflecting on what can be improved helps for better results in future launches.

[in case you missed it] Data Science Weekly - Issue 464

Data Science Weekly Newsletter • 0 implied HN points • 16 Oct 22

🕹 Technology Data science Artificial Intelligence Machine Learning Data Analysis Software Development

Building a community of R users can greatly enhance collaboration and knowledge sharing, especially in specialized fields like pharmaceuticals.
Generating research ideas often starts with identifying gaps in existing literature, which can be guided by specific frameworks to improve the quality of ideas.
Data cleaning is crucial for model accuracy, and its success relies on effective ETL processes and organizational commitment to maintaining high-quality data.

[in case you missed it] Data Science Weekly - Issue 463

Data Science Weekly Newsletter • 0 implied HN points • 09 Oct 22

🕹 Technology Data science Machine Learning Artificial Intelligence Software Development Data Analysis

To explore a large CSV file, you should use handy tools and methods to quickly understand the data without getting overwhelmed.
AI can help convert messy unstructured text into organized data, speeding up tasks that would usually take a long time manually.
Building a career in data science involves learning not just the technical skills but also how to navigate job opportunities and project management.

[in case you missed it] Data Science Weekly - Issue 398

Data Science Weekly Newsletter • 0 implied HN points • 11 Jul 21

🕹 Technology Data science Machine Learning AI Programming Data Analysis

Data science projects can analyze unique datasets, like personal music streaming from Apple Music, helping us understand our listening habits better.
Language affects how cultures understand color, with some languages having fewer words for colors, which is interesting for studying cultural differences.
Using advanced techniques like causal inference can help businesses make better pricing decisions, improving their competitiveness in the market.

[in case you missed it] Data Science Weekly - Issue 390

Data Science Weekly Newsletter • 0 implied HN points • 16 May 21

🕹 Technology Data science Artificial Intelligence Machine Learning Software Development Big Data Data Analysis

AI can solve complex puzzles better than humans, but humans still have unique skills. Don't give up on challenging word games just yet!
Defining trees in biology is tricky because many plants don't fit into clear categories. It's surprising how many things that look like trees actually aren't.
New technology makes searching through large image databases easier. With smart algorithms, you can quickly find the pictures you're looking for without remembering file names.

[in case you missed it] Data Science Weekly - Issue 366

Data Science Weekly Newsletter • 0 implied HN points • 29 Nov 20

🕹 Technology Data science Machine Learning AI Software Engineering Data Analysis

Pinterest improved its data infrastructure by moving from Lambda to Kappa architecture to better handle its visual signals for machine learning. This change aimed to streamline costs and enhance signal availability.
When building machine learning models, companies like DoorDash face huge data challenges. Choosing the right feature store is crucial for managing this data effectively, ensuring performance without overspending.
Differentially private learning still faces challenges in performance compared to traditional models. For effective results, more private data or improved features from public data may be necessary.

S2 E1: Solving Self-Serve Analytics

CAUSL Effect • 0 implied HN points • 02 Oct 23

🕹 Technology Data Analysis User Experience Software Development

Self-serve analytics lets non-analysts access and analyze data without always needing help from an analytics team. This can help speed up decision-making and reduce bottlenecks.
The goal is to create tools and provide education for everyday users so they can do their own analytics easily. Training and tutorials will be essential to help users become comfortable with these tools.
The focus is on keeping users engaged and motivated to use self-serve analytics. Understanding what stops people from doing analytics themselves is key to improving the program.

Writing Useful Performance Reviews: Evaluating the data and writing the review

It Depends / Nimble Autonomy • 0 implied HN points • 19 May 24

💼 Business Management Performance Leadership Human Resources Data Analysis

Collect all relevant data before writing a performance review. This includes past reviews, feedback, and notes so you have a complete view of the person's performance.
Be clear and honest when writing the review. Avoid vague language or trying to balance out negatives with positives; it’s important for the person to understand their true performance.
After writing the reviews, check for patterns or biases. Make sure each review makes sense and supports your conclusions about each person's performance.

Measurement noise: when 1000*1000 isn't a million

filterwizard • 0 implied HN points • 03 Oct 24

🕹 Technology Electronics Engineering Measurement Data Analysis Signal Processing

Measurement noise can make it seem like you need very high accuracy to get correct results, but you might actually need less than you think.
For measuring small signals accurately, the required dynamic range isn't as extreme as multiplying the signal by itself; practical calculations can simplify this.
For specific accuracy requirements in noisy environments, using embedded microcontroller ADCs can be a good solution to achieve realistic signal-to-noise ratios.

How linear-phase filters can still cause phase distortion

filterwizard • 0 implied HN points • 14 Sep 24

🕹 Technology Signal Processing Audio engineering Data Analysis Filters

Even though linear-phase filters are supposed to keep the phase of signals the same, they can still cause unexpected phase changes. This can happen especially at stopband frequencies where the phase might jump abruptly.
Using simple filters, like box-car filters, can lead to problems because they may not completely block unwanted frequencies. This can result in the output signal being inverted or misinterpreted, especially when analyzing important data trends.
It's important to choose the right filter. Either use filters that effectively block unwanted frequencies or ones that don’t cause abrupt phase changes, to avoid messing up the signals you are trying to interpret.

Prediction and negative-delay filters: five things you should know

filterwizard • 0 implied HN points • 19 Aug 24

🕹 Technology Filters Signal Processing Data Analysis

Filters can delay signals as they take time to process inputs and produce outputs. It's important to understand this delay, especially when working with different types of signals.
While you can't completely eliminate delay in filters, you can create compensating filters to achieve zero or even negative group delay at certain frequencies. This can improve the accuracy of your system responses.
Negative-delay filters can actually predict future values of a signal based on its current ramping behavior. This can be really useful in control systems and financial data analysis.

Level Up Your RevOps Career

beyondrevenueoperations • 0 implied HN points • 19 Oct 24

💼 Business Operations Strategy Leadership Data Analysis Growth

RevOps is key to business success, bringing sales, marketing, and customer success teams together to grow revenue. Choosing the right career path in RevOps can greatly influence your impact.
There are two main paths in RevOps: the technical path, which focuses on data analysis and tools, and the strategic path, which emphasizes revenue strategy and leadership. Each path offers unique opportunities and challenges.
Combining technical and strategic skills can create a powerful professional. This 'T-shaped' skillset helps you make better decisions and improve business outcomes.

HN blogs - 16/10/24

HackerNews blogs newsletter • 0 implied HN points • 16 Oct 24

🕹 Technology Software Development Cybersecurity Data Analysis Tech Leadership Web Development

Using Strace can help you track specific system calls instead of every single one, making it easier to debug problems.
Technical leaders should be aware of common decision-making mistakes that can affect their teams and projects.
Understanding the right way to use string parameters in coding can improve your programming practices and avoid confusion.

Mastering DataFrame Joins in Spark: A Comprehensive Guide with Examples

DataSketch’s Substack • 0 implied HN points • 23 Jul 24

🕹 Technology Data science Software Engineering Big Data Data Analysis

DataFrames in Spark are like tables for big data. They help people work with large datasets efficiently across different computers.
There are several types of joins in Spark, such as inner, left, right, and full outer joins. Each type has a specific way of combining data from two DataFrames.
Setting up Spark is easy. You can install it, write a few lines of code to create DataFrames, and start joining data for analysis.

Choosing the Right SQL Technique to Transform Your Data Analysis

DataSketch’s Substack • 0 implied HN points • 24 Jun 24

🕹 Technology Data science Database Management Data Analysis Performance optimization

CTEs help make complex queries easier to read and are good for breaking down hierarchical data. But be careful not to use them too much, as they can slow things down.
Subqueries are useful for filtering and aggregating data, but they can be hard to read and slow if used in a complicated way. They work best for specific tasks in a query.
Temporary views are great for creating reusable logic that only lasts for the session. However, they can't be used outside of that session, so plan accordingly.

Speed to Search Success: Synonyms

Talking to Computers: The Email • 0 implied HN points • 14 Jun 24

🕹 Technology Data Analysis User Behavior Information Retrieval Algorithm Design

Using synonyms in search helps users find what they need faster. It allows them to use their own words instead of worrying about exact terms.
Creating synonyms can be tricky, but observing how users search can help build a better list. Watching what terms people actually use is more effective than guessing.
While synonyms cover many cases, they struggle with specific long terms. For more complex searches, vector search technology might be a better solution.