The hottest Performance Tuning Substack posts right now

Getting timeouts right is important. If you wait too long, your system slows down, but if you timeout too fast, you might miss a successful call.
Circuit breakers help manage failures. They quickly stop requests to a failing service, allowing your system to recover faster.
Bulkheads keep parts of your system separate. If one part fails, the others keep working, preventing a complete shutdown of the system.

You can index data in different ways to improve how retrieval works. This means you don't always have to use the same data for both indexing and retrieving.
One method is to break chunks of data into smaller parts. This helps ensure that the information retrieved is more relevant to what the user is looking for.
Another approach is to index data by the questions they answer or their summaries. This makes it easier to find the right content, even if a user isn't very clear in their queries.

Performance tuning Snowpark on Snowflake can significantly reduce processing time, from half a day to half an hour.
Utilizing the query profiler by Snowflake and making targeted optimizations can have a high impact on performance.
Optimizations like converting UDTFs to UDFs, caching Dataframes, and using batch size annotations can further optimize Snowpark workflows.

Having 'hàng khủng' (powerful resources) isn't always good. Inappropriate use can lead to system issues like overloading.
Hugepages in Linux provide benefits by reducing page table lookup time and ensuring non-swappable memory for important services.
Understanding the tools and resources you are using thoroughly is crucial to avoid unintended consequences.

Properly configuring resources in Spark is really important. Make sure you adjust settings like memory and cores to fit your cluster's total resources.
Good data partitioning helps Spark job performance a lot. For example, repartitioning your data based on a relevant column can lead to faster processing times.
Using broadcast joins can save time and reduce workload. When joining smaller tables, broadcasting can make the process much quicker.

Get a weekly roundup of the best Substack posts, by hacker news affinity: