The hottest Parallelism Substack posts right now

And their main takeaways
Category
Top Technology Topics
Software Bits Newsletter 257 implied HN points 29 Dec 25
  1. Associativity is the key property that lets you split work, combine partial results, and safely parallelize or stream computations without changing the answer.
  2. Softmax has a hidden associative state — tracking a local max and a scaled sum lets you correct and merge chunked results, which is the math behind FlashAttention’s memory- and time-saving trick.
  3. When optimizing a global computation, look for a small combinable state and an associative combine rule; if it exists you can chunk and parallelize, and if it doesn’t (for example, median) you need a different algorithmic approach.
SwirlAI Newsletter 314 implied HN points 06 Aug 23
  1. Choose the right file format for your data storage in Spark like Parquet or ORC for OLAP use cases.
  2. Understand and utilize encoding techniques like Run Length Encoding and Dictionary Encoding in Parquet for efficient data storage.
  3. Optimize Spark Executor Memory allocation and maximize the number of executors for improved application performance.
The AI Observer 3 implied HN points 14 Mar 24
  1. GPUs are essential for modern compute, especially with the rise of AI workloads like large language models that heavily rely on tensor operations like matrix addition and multiplication.
  2. When working with GPUs, programmers use CUDA to define functions called kernels that can be launched on the GPU. Parallelism is explicitly defined and optimized, unlike in CPUs where loops iterate serially over data sets.
  3. Execution on GPUs differs from CPUs due to the minimal cost of GPU hardware threads, efficient thread scheduling at the hardware layer, and the use of warps to execute parallel instructions. GPUs optimize for high throughput with many hardware threads, while CPUs focus on low latency for individual instructions.
Conserving CPU's cycles ... 0 implied HN points 21 May 24
  1. In MSSQL to PostgreSQL migrations, challenges like query slowdowns may arise, with some queries taking significantly longer to execute in PostgreSQL compared to MSSQL.
  2. Join algorithm selection and parallelism are two key advantages contributing to MSSQL's impressive query execution speed.
  3. Multi-clause selectivity estimation in MSSQL allows for more precise cardinality estimation in complex join queries, giving it an edge over PostgreSQL in certain scenarios.
Get a weekly roundup of the best Substack posts, by hacker news affinity: