
Blackfire Continuous Profiler
Low-overhead continuous profiling for app performance optimization.
Discover top open-source software, updated regularly with real-world adoption signals.

Nanosecond-resolution, real-time telemetry profiler for games and apps
A real‑time, nanosecond‑resolution hybrid frame and sampling profiler supporting CPU, GPU, memory, locks, and context switches across C, C++, Lua, Python, Fortran and many more languages.

Tracy is a high‑precision profiler that captures performance data with nanosecond granularity. It combines frame‑based and sampling techniques, allowing developers to see both per‑frame timing and statistical distributions of code execution.
The tool instruments CPU code (native C/C++, Lua, Python, Fortran, etc.) and a wide range of GPU APIs—including OpenGL, Vulkan, Direct3D 11/12, Metal, OpenCL, and CUDA. It also tracks memory allocations, lock contention, and context switches, automatically linking screenshots to captured frames for visual correlation.
Tracy runs as a separate server process that receives telemetry over the network. Instrumented applications embed lightweight macros or API calls, then stream data to the Tracy UI client, which is available for Windows, Linux, and macOS. Pre‑built Windows x64 binaries are provided, and the source can be built with CMake or Meson for other platforms.
When teams consider Tracy Profiler, these hosted platforms usually appear on the same shortlist.
Looking for a hosted option? These are the services engineering teams benchmark against before choosing open source.
Frame‑by‑frame performance debugging in a game engine
Identify spikes per frame, correlate with GPU API calls, and reduce latency.
GPU kernel profiling for Vulkan compute workloads
Measure kernel execution time, memory transfers, and pinpoint bottlenecks.
Memory allocation tracking in a scientific Fortran application
Visualize allocation patterns, detect leaks, and optimize memory usage.
Cross‑language profiling of a Python‑driven simulation with C++ core
Combine Python and native call stacks to see end‑to‑end performance.
Only minimal instrumentation macros are needed; you add them where you want timing data.
Runs on Windows, Linux, and macOS, and supports OpenGL, Vulkan, Direct3D, Metal, CUDA, and OpenCL.
The profiler runs as a separate server process; instrumented applications send data over TCP/UDP to the UI client.
Yes, Tracy provides a cross‑platform graphical client that displays timelines, flame graphs, and resource usage.
Community bindings exist for many languages, and you can write custom wrappers using the C API.
Project at a glance
ActiveLast synced 4 days ago