| <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" |
| "http://www.w3.org/TR/html4/strict.dtd"> |
| <html> |
| <head> |
| <title>Debugging JITed Code With GDB</title> |
| <link rel="stylesheet" href="llvm.css" type="text/css"> |
| </head> |
| <body> |
| |
| <div class="doc_title">Debugging JITed Code With GDB</div> |
| <ol> |
| <li><a href="#introduction">Introduction</a></li> |
| <li><a href="#quickstart">Quickstart</a></li> |
| <li><a href="#example">Example with clang and lli</a></li> |
| </ol> |
| <div class="doc_author">Written by Reid Kleckner</div> |
| |
| <!--=========================================================================--> |
| <div class="doc_section"><a name="introduction">Introduction</a></div> |
| <!--=========================================================================--> |
| <div class="doc_text"> |
| |
| <p>Without special runtime support, debugging dynamically generated code with |
| GDB (as well as most debuggers) can be quite painful. Debuggers generally read |
| debug information from the object file of the code, but for JITed code, there is |
| no such file to look for. |
| </p> |
| |
| <p>Depending on the architecture, this can impact the debugging experience in |
| different ways. For example, on most 32-bit x86 architectures, you can simply |
| compile with -fno-omit-framepointer for GCC and -fdisable-fp-elim for LLVM. |
| When GDB creates a backtrace, it can properly unwind the stack, but the stack |
| frames owned by JITed code have ??'s instead of the appropriate symbol name. |
| However, on Linux x86_64 in particular, GDB relies on the DWARF CFA debug |
| information to unwind the stack, so even if you compile your program to leave |
| the frame pointer untouched, GDB will usually be unable to unwind the stack past |
| any JITed code stack frames. |
| </p> |
| |
| <p>In order to communicate the necessary debug info to GDB, an interface for |
| registering JITed code with debuggers has been designed and implemented for |
| GDB and LLVM. At a high level, whenever LLVM generates new machine code, it |
| also generates an object file in memory containing the debug information. LLVM |
| then adds the object file to the global list of object files and calls a special |
| function (__jit_debug_register_code) marked noinline that GDB knows about. When |
| GDB attaches to a process, it puts a breakpoint in this function and loads all |
| of the object files in the global list. When LLVM calls the registration |
| function, GDB catches the breakpoint signal, loads the new object file from |
| LLVM's memory, and resumes the execution. In this way, GDB can get the |
| necessary debug information. |
| </p> |
| |
| <p>At the time of this writing, LLVM only supports architectures that use ELF |
| object files and it only generates symbols and DWARF CFA information. However, |
| it would be easy to add more information to the object file, so we don't need to |
| coordinate with GDB to get better debug information. |
| </p> |
| </div> |
| |
| <!--=========================================================================--> |
| <div class="doc_section"><a name="quickstart">Quickstart</a></div> |
| <!--=========================================================================--> |
| <div class="doc_text"> |
| |
| <p>In order to debug code JITed by LLVM, you need to install a recent version |
| of GDB. The interface was added on 2009-08-19, so you need a snapshot of GDB |
| more recent than that. Either download a snapshot of GDB or checkout CVS as |
| instructed <a href="http://www.gnu.org/software/gdb/current/">here</a>. Here |
| are the commands for doing a checkout and building the code: |
| </p> |
| |
| <pre class="doc_code"> |
| $ cvs -z 3 -d :pserver:anoncvs@sourceware.org:/cvs/src co gdb |
| $ mv src gdb # You probably don't want this checkout called "src". |
| $ cd gdb |
| $ ./configure --prefix="$GDB_INSTALL" |
| $ make |
| $ make install |
| </pre> |
| |
| <p>You can then use -jit-emit-debug in the LLVM command line arguments to enable |
| the interface. |
| </p> |
| </div> |
| |
| <!--=========================================================================--> |
| <div class="doc_section"><a name="example">Example with clang and lli</a></div> |
| <!--=========================================================================--> |
| <div class="doc_text"> |
| |
| <p>For example, consider debugging running lli on the following C code in |
| foo.c: |
| </p> |
| |
| <pre class="doc_code"> |
| #include <stdio.h> |
| |
| void foo() { |
| printf("%d\n", *(int*)NULL); // Crash here |
| } |
| |
| void bar() { |
| foo(); |
| } |
| |
| void baz() { |
| bar(); |
| } |
| |
| int main(int argc, char **argv) { |
| baz(); |
| } |
| </pre> |
| |
| <p>Here are the commands to run that application under GDB and print the stack |
| trace at the crash: |
| </p> |
| |
| <pre class="doc_code"> |
| # Compile foo.c to bitcode. You can use either clang or llvm-gcc with this |
| # command line. Both require -fexceptions, or the calls are all marked |
| # 'nounwind' which disables DWARF CFA info. |
| $ clang foo.c -fexceptions -emit-llvm -c -o foo.bc |
| |
| # Run foo.bc under lli with -jit-emit-debug. If you built lli in debug mode, |
| # -jit-emit-debug defaults to true. |
| $ $GDB_INSTALL/gdb --args lli -jit-emit-debug foo.bc |
| ... |
| |
| # Run the code. |
| (gdb) run |
| Starting program: /tmp/gdb/lli -jit-emit-debug foo.bc |
| [Thread debugging using libthread_db enabled] |
| |
| Program received signal SIGSEGV, Segmentation fault. |
| 0x00007ffff7f55164 in foo () |
| |
| # Print the backtrace, this time with symbols instead of ??. |
| (gdb) bt |
| #0 0x00007ffff7f55164 in foo () |
| #1 0x00007ffff7f550f9 in bar () |
| #2 0x00007ffff7f55099 in baz () |
| #3 0x00007ffff7f5502a in main () |
| #4 0x00000000007c0225 in llvm::JIT::runFunction(llvm::Function*, |
| std::vector<llvm::GenericValue, |
| std::allocator<llvm::GenericValue> > const&) () |
| #5 0x00000000007d6d98 in |
| llvm::ExecutionEngine::runFunctionAsMain(llvm::Function*, |
| std::vector<std::string, |
| std::allocator<std::string> > const&, char const* const*) () |
| #6 0x00000000004dab76 in main () |
| </pre> |
| </div> |
| |
| <p>As you can see, GDB can correctly unwind the stack and has the appropriate |
| function names. |
| </p> |
| |
| <!-- *********************************************************************** --> |
| <hr> |
| <address> |
| <a href="http://jigsaw.w3.org/css-validator/check/referer"><img |
| src="http://jigsaw.w3.org/css-validator/images/vcss-blue" alt="Valid CSS"></a> |
| <a href="http://validator.w3.org/check/referer"><img |
| src="http://www.w3.org/Icons/valid-html401-blue" alt="Valid HTML 4.01"></a> |
| <a href="mailto:reid.kleckner@gmail.com">Reid Kleckner</a><br> |
| <a href="http://llvm.org">The LLVM Compiler Infrastructure</a><br> |
| Last modified: $Date: 2009-01-01 23:10:51 -0800 (Thu, 01 Jan 2009) $ |
| </address> |
| </body> |
| </html> |