davidj.substack

davidj.substack focuses on the exploration and application of data strategies, tools, and practices for organizational impact. It covers the evolution of the Modern Data Stack, the importance of semantic layers, data team dynamics, and tooling efficiencies to enhance data handling, analytics, and management.

Data Strategy and Analysis Data Team Management Modern Data Stack Technologies Semantic Layers in Data Analytics Data Quality and Governance Tooling and Automation in Data Management Productivity and Efficiency in Data Operations Real-time Data Processing

The hottest Substack posts of davidj.substack

And their main takeaways
83 implied HN points 05 Apr 23
  1. Semantic layers are crucial for governance, security, accessibility, and developer experience benefits in data analytics.
  2. Standalone semantic layers offer more flexibility and serve multiple use cases compared to semantic layers built into BI tools.
  3. Different standalone semantic layer options like Cube, AtScale, dbt/MetricFlow, and Looker Modeller provide unique features and cater to varying needs in data modeling and analytics.
23 implied HN points 29 Feb 24
  1. Consider how to use a semantic layer with streaming data to enhance efficiency and data processing.
  2. Streaming data warehouses handle storage differently than batch data warehouses, keeping fresh data in-memory and reducing compute cost.
  3. The semantic layer abstracts entities, attributes, and metrics, aiding in managing and optimizing queries on streaming data.
Get a weekly roundup of the best Substack posts, by hacker news affinity:
2 HN points 07 Mar 24
  1. Text-to-semantic layer systems can work in enterprise but text-to-SQL ones won't due to technical deficiencies.
  2. Even with infinite resources, achieving a perfect text-to-SQL system may not be enough due to the importance of how data is perceived by stakeholders.
  3. Blame and humiliation dynamics in human interactions make text-to-semantic layer systems more viable than text-to-SQL systems in corporate settings.
0 implied HN points 17 Dec 24
  1. There's a new command called `sqlmesh cube_generate` that helps build models for data analysis. It's designed to make working with data easier for users.
  2. The tool outputs useful information in a structured format, which includes joins and fields for data analysis. This makes it simple to understand how the data connects.
  3. Even if there are challenges with complex data types, the output is still effective and can be enhanced using AI, showing there's room for creativity in data modeling.