资 源 简 介
Overview
As parallel execution platforms continue to proliferate, there is a growing need for real-time introspection tools to provide insight into platform behavior for performance debugging, correctness checks, and to drive effective resource management schemes. To address this need, we present the _Lynx_ dynamic instrumentation system. Lynx provides the capability to write instrumentation routines that are (1) selective, instrumenting
only what is needed, (2) transparent, without changes to the applications’ source code, (3) customizable, and (4) efficient.
Lynx was originally implemented as a branch of GPU Ocelot, a framework that provides run-time code generation of CUDA programs for heterogeneous architectures. Lynx now exists as a stand-alone, PTX editing tool, encapsulating only the necessary Ocelot dependencies (namely, Oc