| <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01//EN" | 
 |                       "http://www.w3.org/TR/html4/strict.dtd"> | 
 | <html> | 
 | <head> | 
 |   <title>System Library</title> | 
 |   <link rel="stylesheet" href="llvm.css" type="text/css"> | 
 | </head> | 
 | <body> | 
 |  | 
 | <div class="doc_title">System Library</div> | 
 | <ul> | 
 |   <li><a href="#abstract">Abstract</a></li> | 
 |   <li><a href="#requirements">Keeping LLVM Portable</a> | 
 |   <ol> | 
 |     <li><a href="#headers">Don't Include System Headers</a></li> | 
 |     <li><a href="#expose">Don't Expose System Headers</a></li> | 
 |     <li><a href="#c_headers">Allow Standard C Header Files</a></li> | 
 |     <li><a href="#cpp_headers">Allow Standard C++ Header Files</a></li> | 
 |     <li><a href="#highlev">High-Level Interface</a></li> | 
 |     <li><a href="#nofunc">No Exposed Functions</a></li> | 
 |     <li><a href="#nodata">No Exposed Data</a></li> | 
 |     <li><a href="#nodupl">No Duplicate Implementations</a></li> | 
 |     <li><a href="#nounused">No Unused Functionality</a></li> | 
 |     <li><a href="#virtuals">No Virtual Methods</a></li> | 
 |     <li><a href="#softerrors">Minimize Soft Errors</a></li> | 
 |     <li><a href="#throw">Throw Only std::string</a></li> | 
 |     <li><a href="#throw_spec">No throw() Specifications</a></li> | 
 |     <li><a href="#organization">Code Organization</a></li> | 
 |     <li><a href="#semantics">Consistent Semantics</a></li> | 
 |     <li><a href="#bug">Tracking Bugzilla Bug: 351</a></li> | 
 |   </ol></li> | 
 | </ul> | 
 |  | 
 | <div class="doc_author"> | 
 |   <p>Written by <a href="rspencer@x10sys.com">Reid Spencer</a></p> | 
 | </div> | 
 |  | 
 |  | 
 | <!-- *********************************************************************** --> | 
 | <div class="doc_section"><a name="abstract">Abstract</a></div> | 
 | <div class="doc_text"> | 
 |   <p>This document provides some details on LLVM's System Library, located in | 
 |   the source at <tt>lib/System</tt> and <tt>include/llvm/System</tt>. The | 
 |   library's purpose is to shield LLVM from the differences between operating | 
 |   systems for the few services LLVM needs from the operating system. Much of | 
 |   LLVM is written using portability features of standard C++. However, in a few | 
 |   areas, system dependent facilities are needed and the System Library is the | 
 |   wrapper around those system calls.</p> | 
 |   <p>By centralizing LLVM's use of operating system interfaces, we make it  | 
 |   possible for the LLVM tool chain and runtime libraries to be more easily  | 
 |   ported to new platforms since (theoretically) only <tt>lib/System</tt> needs  | 
 |   to be ported.  This library also unclutters the rest of LLVM from #ifdef use  | 
 |   and special cases for specific operating systems. Such uses are replaced  | 
 |   with simple calls to the interfaces provided in <tt>include/llvm/System</tt>. | 
 |   </p>  | 
 |   <p>Note that the System Library is not intended to be a complete operating  | 
 |   system wrapper (such as the Adaptive Communications Environment (ACE) or  | 
 |   Apache Portable Runtime (APR)), but only provides the functionality necessary | 
 |   to support LLVM. | 
 |   <p>The System Library was written by Reid Spencer who formulated the | 
 |   design based on similar work originating from the eXtensible Programming  | 
 |   System (XPS). Several people helped with the effort; especially, | 
 |   Jeff Cohen and Henrik Bach on the Win32 port.</p> | 
 | </div> | 
 |  | 
 | <!-- *********************************************************************** --> | 
 | <div class="doc_section"> | 
 |   <a name="requirements">Keeping LLVM Portable</a> | 
 | </div> | 
 | <div class="doc_text"> | 
 |   <p>In order to keep LLVM portable, LLVM developers should adhere to a set of | 
 |   portability rules associated with the System Library. Adherence to these rules | 
 |   should help the System Library achieve its goal of shielding LLVM from the | 
 |   variations in operating system interfaces and doing so efficiently.  The  | 
 |   following sections define the rules needed to fulfill this objective.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="headers">Don't Inlcude System Headers</a> | 
 | </div> | 
 | <div class="doc_text"> | 
 |   <p>Except in <tt>lib/System</tt>, no LLVM source code should directly | 
 |   <tt>#include</tt> a system header. Care has been taken to remove all such | 
 |   <tt>#includes</tt> from LLVM while <tt>lib/System</tt> was being | 
 |   developed.  Specifically this means that header files like "unistd.h",  | 
 |   "windows.h", "stdio.h", and "string.h" are forbidden to be included by LLVM  | 
 |   source code outside the implementation of <tt>lib/System</tt>.</p> | 
 |   <p>To obtain system-dependent functionality, existing interfaces to the system | 
 |   found in <tt>include/llvm/System</tt> should be used. If an appropriate  | 
 |   interface is not available, it should be added to <tt>include/llvm/System</tt> | 
 |   and implemented in <tt>lib/System</tt> for all supported platforms.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="expose">Don't Expose System Headers</a> | 
 | </div> | 
 | <div class="doc_text"> | 
 |   <p>The System Library must shield LLVM from <em>all</em> system headers. To  | 
 |   obtain system level functionality, LLVM source must  | 
 |   <tt>#include "llvm/System/Thing.h"</tt> and nothing else. This means that  | 
 |   <tt>Thing.h</tt> cannot expose any system header files. This protects LLVM  | 
 |   from accidentally using system specific functionality and only allows it | 
 |   via the <tt>lib/System</tt> interface.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="c_headers">Use Standard C Headers</a></div> | 
 | <div class="doc_text"> | 
 |   <p>The <em>standard</em> C headers (the ones beginning with "c") are allowed | 
 |   to be exposed through the <tt>lib/System</tt> interface. These headers and  | 
 |   the things they declare are considered to be platform agnostic. LLVM source  | 
 |   files may include them directly or obtain their inclusion through  | 
 |   <tt>lib/System</tt> interfaces.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="cpp_headers">Use Standard C++ Headers</a> | 
 | </div> | 
 | <div class="doc_text"> | 
 |   <p>The <em>standard</em> C++ headers from the standard C++ library and | 
 |   standard template library may be exposed through the <tt>lib/System</tt> | 
 |   interface. These headers and the things they declare are considered to be | 
 |   platform agnostic. LLVM source files may include them or obtain their | 
 |   inclusion through lib/System interfaces.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="highlev">High Level Interface</a></div> | 
 | <div class="doc_text"> | 
 |   <p>The entry points specified in the interface of lib/System must be aimed at  | 
 |   completing some reasonably high level task needed by LLVM. We do not want to | 
 |   simply wrap each operating system call. It would be preferable to wrap several | 
 |   operating system calls that are always used in conjunction with one another by | 
 |   LLVM.</p> | 
 |   <p>For example, consider what is needed to execute a program, wait for it to | 
 |   complete, and return its result code. On Unix, this involves the following | 
 |   operating system calls: <tt>getenv, fork, execve,</tt> and <tt>wait</tt>. The | 
 |   correct thing for lib/System to provide is a function, say | 
 |   <tt>ExecuteProgramAndWait</tt>, that implements the functionality completely. | 
 |   what we don't want is wrappers for the operating system calls involved.</p> | 
 |   <p>There must <em>not</em> be a one-to-one relationship between operating | 
 |   system calls and the System library's interface. Any such interface function | 
 |   will be suspicious.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="nounused">No Unused Functionality</a></div> | 
 | <div class="doc_text"> | 
 |   <p>There must be no functionality specified in the interface of lib/System  | 
 |   that isn't actually used by LLVM. We're not writing a general purpose | 
 |   operating system wrapper here, just enough to satisfy LLVM's needs. And, LLVM | 
 |   doesn't need much. This design goal aims to keep the lib/System interface | 
 |   small and understandable which should foster its actual use and adoption.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="nodupl">No Duplicate Implementations</a> | 
 | </div> | 
 | <div class="doc_text"> | 
 |   <p>The implementation of a function for a given platform must be written | 
 |   exactly once. This implies that it must be possible to apply a function's  | 
 |   implementation to multiple operating systems if those operating systems can | 
 |   share the same implementation. This rule applies to the set of operating | 
 |   systems supported for a given class of operating system (e.g. Unix, Win32). | 
 |   </p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="virtuals">No Virtual Methods</a></div> | 
 | <div class="doc_text"> | 
 |   <p>The System Library interfaces can be called quite frequently by LLVM. In | 
 |   order to make those calls as efficient as possible, we discourage the use of | 
 |   virtual methods. There is no need to use inheritance for implementation | 
 |   differences, it just adds complexity. The <tt>#include</tt> mechanism works | 
 |   just fine.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="nofunc">No Exposed Functions</a></div> | 
 | <div class="doc_text"> | 
 |   <p>Any functions defined by system libraries (i.e. not defined by lib/System)  | 
 |   must not be exposed through the lib/System interface, even if the header file  | 
 |   for that function is not exposed. This prevents inadvertent use of system | 
 |   specific functionality.</p> | 
 |   <p>For example, the <tt>stat</tt> system call is notorious for having | 
 |   variations in the data it provides. <tt>lib/System</tt> must not declare  | 
 |   <tt>stat</tt> nor allow it to be declared. Instead it should provide its own  | 
 |   interface to discovering information about files and directories. Those  | 
 |   interfaces may be implemented in terms of <tt>stat</tt> but that is strictly  | 
 |   an implementation detail. The interface provided by the System Library must | 
 |   be implemented on all platforms (even those without <tt>stat</tt>).</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="nodata">No Exposed Data</a></div> | 
 | <div class="doc_text"> | 
 |   <p>Any data defined by system libraries (i.e. not defined by lib/System) must | 
 |   not be exposed through the lib/System interface, even if the header file for | 
 |   that function is not exposed. As with functions, this prevents inadvertent use | 
 |   of data that might not exist on all platforms.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="softerrors">Minimize Soft Errors</a></div> | 
 | <div class="doc_text"> | 
 |   <p>Operating system interfaces will generally provide error results for every | 
 |   little thing that could go wrong. In almost all cases, you can divide these | 
 |   error results into two groups: normal/good/soft and abnormal/bad/hard. That | 
 |   is, some of the errors are simply information like "file not found",  | 
 |   "insufficient privileges", etc. while other errors are much harder like | 
 |   "out of space", "bad disk sector", or "system call interrupted". We'll call  | 
 |   the first group "<i>soft</i>" errors and the second group "<i>hard</i>"  | 
 |   errors.<p> | 
 |   <p>lib/System must always attempt to minimize soft errors and always just | 
 |   throw a std::string on hard errors. This is a design requirement because the | 
 |   minimization of soft errors can affect the granularity and the nature of the | 
 |   interface. In general, if you find that you're wanting to throw soft errors, | 
 |   you must review the granularity of the interface because it is likely you're | 
 |   trying to implement something that is too low level. The rule of thumb is to | 
 |   provide interface functions that <em>can't</em> fail, except when faced with  | 
 |   hard errors.</p> | 
 |   <p>For a trivial example, suppose we wanted to add an "OpenFileForWriting"  | 
 |   function. For many operating systems, if the file doesn't exist, attempting  | 
 |   to open the file will produce an error.  However, lib/System should not | 
 |   simply throw that error if it occurs because its a soft error. The problem | 
 |   is that the interface function, OpenFileForWriting is too low level. It should | 
 |   be OpenOrCreateFileForWriting. In the case of the soft "doesn't exist" error,  | 
 |   this function would just create it and then open it for writing.</p> | 
 |   <p>This design principle needs to be maintained in lib/System because it | 
 |   avoids the propagation of soft error handling throughout the rest of LLVM. | 
 |   Hard errors will generally just cause a termination for an LLVM tool so don't | 
 |   be bashful about throwing them.</p> | 
 |   <p>Rules of thumb:</p> | 
 |   <ol> | 
 |     <li>Don't throw soft errors, only hard errors.</li> | 
 |     <li>If you're tempted to throw a soft error, re-think the interface.</li> | 
 |     <li>Handle internally the most common normal/good/soft error conditions | 
 |     so the rest of LLVM doesn't have to.</li> | 
 |   </ol> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="throw">Throw Only std::string</a></div> | 
 | <div class="doc_text"> | 
 |   <p>If an error occurs that lib/System cannot handle, the only action taken by | 
 |   lib/System is to throw an instance of std:string. The contents of the string | 
 |   must explain both what happened and the context in which it happened. The | 
 |   format of the string should be a (possibly empty) list of contexts each  | 
 |   terminated with a : and a space, followed by the error message, optionally | 
 |   followed by a reason, and optionally followed by a suggestion.</p> | 
 |   <p>For example, failure to open a file named "foo" could result in a message | 
 |   like:</p> | 
 |   <ul><li>foo: Unable to open file because it doesn't exist."</li></ul> | 
 |   <p>The "foo:" part is the context. The "Unable to open file" part is the error | 
 |   message. The "because it doesn't exist." part is the reason. This message has | 
 |   no suggestion. Where possible, the implementation of lib/System should use | 
 |   operating system specific facilities for converting the error code returned by | 
 |   a system call into an error message. This will help to make the error message | 
 |   more familiar to users of that type of operating system.</p> | 
 |   <p>Note that this requirement precludes the throwing of any other exceptions. | 
 |   For example, various C++ standard library functions can cause exceptions to be | 
 |   thrown (e.g. out of memory situation). In all cases, if there is a possibility | 
 |   that non-string exceptions could be thrown, the lib/System library must ensure | 
 |   that the exceptions are translated to std::string form.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="throw_spec">No throw Specifications</a> | 
 | </div> | 
 | <div class="doc_text"> | 
 |   <p>None of the lib/System interface functions may be declared with C++  | 
 |   <tt>throw()</tt> specifications on them. This requirement makes sure that the | 
 |   compiler does not insert additional exception handling code into the interface | 
 |   functions. This is a performance consideration: lib/System functions are at | 
 |   the bottom of many call chains and as such can be frequently called. We | 
 |   need them to be as efficient as possible.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="organization">Code Organization</a></div> | 
 | <div class="doc_text"> | 
 |   <p>Implementations of the System Library interface are separated by their | 
 |   general class of operating system. Currently only Unix and Win32 classes are | 
 |   defined but more could be added for other operating system classifications. | 
 |   To distinguish which implementation to compile, the code in lib/System uses | 
 |   the LLVM_ON_UNIX and LLVM_ON_WIN32 #defines provided via configure through the | 
 |   llvm/Config/config.h file. Each source file in lib/System, after implementing | 
 |   the generic (operating system independent) functionality needs to include the | 
 |   correct implementation using a set of <tt>#if defined(LLVM_ON_XYZ)</tt>  | 
 |   directives. For example, if we had lib/System/File.cpp, we'd expect to see in | 
 |   that file:</p> | 
 |   <pre><tt> | 
 |   #if defined(LLVM_ON_UNIX) | 
 |   #include "Unix/File.cpp" | 
 |   #endif | 
 |   #if defined(LLVM_ON_WIN32) | 
 |   #include "Win32/File.cpp" | 
 |   #endif | 
 |   </tt></pre> | 
 |   <p>The implementation in lib/System/Unix/File.cpp should handle all Unix | 
 |   variants. The implementation in lib/System/Win32/File.cpp should handle all | 
 |   Win32 variants.  What this does is quickly differentiate the basic class of  | 
 |   operating system that will provide the implementation. The specific details | 
 |   for a given platform must still be determined through the use of | 
 |   <tt>#ifdef</tt>.</p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="semantics">Consistent Semantics</a></div> | 
 | <div class="doc_text"> | 
 |   <p>The implementation of a lib/System interface can vary drastically between | 
 |   platforms. That's okay as long as the end result of the interface function  | 
 |   is the same. For example, a function to create a directory is pretty straight | 
 |   forward on all operating system. System V IPC on the other hand isn't even | 
 |   supported on all platforms. Instead of "supporting" System V IPC, lib/System | 
 |   should provide an interface to the basic concept of inter-process  | 
 |   communications. The implementations might use System V IPC if that was  | 
 |   available or named pipes, or whatever gets the job done effectively for a  | 
 |   given operating system.  In all cases, the interface and the implementation  | 
 |   must be semantically consistent. </p> | 
 | </div> | 
 |  | 
 | <!-- ======================================================================= --> | 
 | <div class="doc_subsection"><a name="bug">Bug 351</a></div> | 
 | <div class="doc_text"> | 
 |   <p>See <a href="http://llvm.org/PR351">bug 351</a> | 
 |   for further details on the progress of this work</p> | 
 | </div> | 
 |  | 
 | <!-- *********************************************************************** --> | 
 |  | 
 | <hr> | 
 | <address> | 
 |   <a href="http://jigsaw.w3.org/css-validator/check/referer"><img | 
 |   src="http://jigsaw.w3.org/css-validator/images/vcss" alt="Valid CSS!"></a> | 
 |   <a href="http://validator.w3.org/check/referer"><img | 
 |   src="http://www.w3.org/Icons/valid-html401" alt="Valid HTML 4.01!"></a> | 
 |  | 
 |   <a href="mailto:rspencer@x10sys.com">Reid Spencer</a><br> | 
 |   <a href="http://llvm.org">LLVM Compiler Infrastructure</a><br> | 
 |   Last modified: $Date$ | 
 | </address> | 
 | </body> | 
 | </html> |