Performance
- Will adding K8S CPU limit reduce service performance?www.sobyte.net Will adding K8S CPU limit reduce service performance?
Explore whether adding the K8S CPU limit will degrade service performance.
- Exploring How Cache Memory Really Workspikuma.com Exploring How Cache Memory Really Works
Even though we often hear terms like L1, L2, cache block size, etc., most programmers have a limited understanding of what cache really is. This is a beginner-friendly primer on how cache works.
- Measure Your Maven Build · Maarten on ITmaarten.mulders.it Measure Your Maven Build · Maarten on IT
This blog introduces three mechanisms to investigate the execution time of a Maven build. Having a reliable way to measure build execution time can help identify bottlenecks. This in turn helps making effective improvements, thereby contributing to higher developer productivity. Find out how to effe...
- AI-Powered Flamegraph Interpreter in Grafana Pyroscope | Open Source Continuous Profiling Platformpyroscope.io AI-Powered Flamegraph Interpreter in Grafana Pyroscope | Open Source Continuous Profiling Platform
Explore how our AI-powered tool is revolutionizing flamegraph interpretation
- Using perf to profile Java applicationsbell-sw.com Using perf to profile Java applications | BellSoft Java
Find out how to use perf, a built-in Linux profiler, to analyze the performance of Java applications.
- What would happen to low latency trading if exchanges moved to the cloud?
ULL trading firms go to a lot of trouble to get their servers and switches within the same buildings as the exchanges they trade with to reduce latency. Some firms don’t even use layer 1 switches to be competitive.
- No Restarts, No Disruptions: Seamless Pod Resource updates with In-Place Resizingengineering.doit.com No Restarts, No Disruptions: Seamless Pod Resource updates with In-Place Resizing
Updating Kubernetes pod resources in-place is a new way to optimize your k8s cluster without disruptions or restarts!
- Predictive CPU isolation of containers at Netflixnetflixtechblog.com Predictive CPU isolation of containers at Netflix
By Benoit Rostykus, Gabriel Hartmann
- LLaMA Now Goes Faster on CPUsjustine.lol LLaMA Now Goes Faster on CPUs
I wrote 84 new matmul kernels to improve llamafile CPU performance.
My kernels go 2x faster than MKL for matrices that fit in L2 cache, which makes them a work in progress, since the speedup works best for prompts having fewer than 1,000 tokens.
- Bump Allocation: Up or Down?coredumped.dev Bump Allocation: Up or Down?
Back in 2019, Nick Fitzgerald published always bump downwards, an article making the case that for bump allocators, bumping “down” (towards lower addresses) is better than bumping up. The biggest reasons for this are bumping up requires 3 branches vs 2 for bumping down and rounding down requires few...
- More powerful Go execution traces - The Go Programming Languagego.dev More powerful Go execution traces - The Go Programming Language
New features and improvements to execution traces from the last year.
- Improving Network Performance with Linux Flowtableswww.ubicloud.com Improving Network Performance with Linux Flowtables
We’re building an open source alternative to AWS. For IPv4 assignment and firewall rules, we use Linux’s Netfilter / Nftables. This subsystem provides a powerful way to handle packets addressed to the host. We recently came across flowtables - a network acceleration feature in the Linux kernel that ...
- Perf is not enoughmotherduck.com Perf is not enough
Are database benchmarks still relevant ? Let's understand why it's a poor way to choose a database.
- Grafana Beyla 1.2 release: eBPF auto-instrumentation with full Kubernetes supportgrafana.com Grafana Beyla 1.2 release: eBPF auto-instrumentation with full Kubernetes support | Grafana Labs
Grafana Beyla 1.2 offers improved Kubernetes support, including the ability to decorate metrics and traces with the metadata of Kubernetes pods and nodes.
- UUID Benchmark Warardentperf.com UUID Benchmark War
This month’s PGSQL Phriday #015 topic is about UUIDs, hosted by Lætitia Avrot. Lætitia has called for a debate. No, no, no. I say let’s have an all-out war. A benchmark war. I have deci…
- JDK HTTP server handles 100,000 req/sec with 100 ms start-up time and 50 MB modular run-time image. Built with OpenJDK 21 and virtual threadsgithub.com GitHub - ebarlas/java-httpserver-vthreads: Benchmarks for JDK HTTP Server running on Java 21 with Virtual Threads
Benchmarks for JDK HTTP Server running on Java 21 with Virtual Threads - GitHub - ebarlas/java-httpserver-vthreads: Benchmarks for JDK HTTP Server running on Java 21 with Virtual Threads
- Bloom filter calculatorhur.st Bloom filter calculator
Calculate the optimal size for your bloom filter, see how many items a given filter can hold, or just admire the curvy graphs. Also borrow my MIT licensed Javascript for your own programs.
- Getting started with Web Performancewww.htmhell.dev Getting started with Web Performance 🚀 - HTMHell
A collection of bad practices in HTML, copied from real websites.
- Accurate CPU sampling without perf_events · Issue #855 · async-profiler/async-profilergithub.com Accurate CPU sampling without perf_events · Issue #855 · async-profiler/async-profiler
Background Async-profiler has two options for CPU profiling: -e cpu and -e itimer. -e cpu (default) relies on perf_events, while -e timer is based on setitimer. Perf events typically yield more acc...
- New in Grafana k6: The latest OSS features in v0.47.0 and more efficient performance testing in Grafana Cloud k6 | Grafana Labsgrafana.com New in Grafana k6: The latest OSS features in v0.47.0 and more efficient performance testing in Grafana Cloud k6 | Grafana Labs
Grafana k6 v0.47.0 is here, introducing binary metadata support in the k6 gRPC module, Docker images for ARM64 architecture, and other exciting new features.
- PostgreSQL JDBC Statement Caching - Vlad Mihalceavladmihalcea.com PostgreSQL JDBC Statement Caching - Vlad Mihalcea
Learn how the PostgreSQL JDBC Driver provides Statement Caching and what settings you need to configure in order to optimize performance.
- JavaZone 2023 presentation: Cracking the scalability wall - by Sanne Grinovero2023.javazone.no JavaZone 2023 presentation: Cracking the scalability wall - by Sanne Grinovero
In this talk, we’ll discuss a severe JVM performance issue, the methodology leading to its discovery, how it affects the whole Java ecosystem, and what you could do to avoid it in your code. This scalability bottleneck has existed in OpenJDK for decades; the most surprising aspect is how such an im...
- Good performance is not just big O - Julio Merino (jmmv.dev)jmmv.dev Good performance is not just big O - Julio Merino (jmmv.dev)
Having a fast and responsive app is orthogonal to “knowing your big Os”. Unfortunately, most tech companies over-emphasize algorithms in interviews and downplay systems knowledge, and I believe that’s one reason behind sluggish apps and bloated systems. I’ve seen this play out repeatedly. Interviewe...
- FreeBSD (HEAD) no longer spends time running a bubblesort on its SYSINITs. We're now running a mergesort which is ~100x faster
https://news.ycombinator.com/item?id=37205053
> For people (like me) who are wondering why a kernel needs to boot in under 28ms: It's for virtual machines that get launched on-demand in services like AWS Lambda. https://www.daemonology.net/blog/2022-10-18-FreeBSD-Firecrac...
- simdjson/simdjson-java: A Java version of simdjsongithub.com GitHub - simdjson/simdjson-java: A Java version of simdjson
A Java version of simdjson. Contribute to simdjson/simdjson-java development by creating an account on GitHub.
A Java version of simdjson - a JSON parser using SIMD instructions, based on the paper Parsing Gigabytes of JSON per Second by Geoff Langdale and Daniel Lemire.
- Memory Tuning a High Throughput Microserviceblog.flipkart.tech Memory Tuning a High Throughput Microservice
At Flipkart, over a billion recommendations are served to users every day. These recommendations help millions of users discover products…
- Caching Strategies and How to Choose the Right Onecodeahoy.com Caching Strategies and How to Choose the Right One
Compare the pros and cons of various caching strategies to choose the best one for your use case.
- Myths Programmers Believe about CPU Cachessoftware.rajivprab.com Myths Programmers Believe about CPU Caches
As a computer engineer who has spent half a decade working with caches at Intel and Sun, I’ve learnt a thing or two about cache-coherency. This was one of the hardest concepts to learn back in coll…
- Hotspot performance engineering failslemire.me Hotspot performance engineering fails
Developers often believe that software performance follows a Pareto distribution: 80% of the running time is spent in 20% of the code. Using this model, you can write most of your code without any care for performance and focus on the narrow pieces of code that are performance sensitive. Engineers l