| |
| Implementation notes: |
| |
| This is a true OS/400 implementation, not a PASE implementation (for PASE, |
| use an AIX implementation). |
| |
| The biggest problem with OS/400 is EBCDIC. The current libxml2 implementation |
| uses UTF-8 internally. To ease encoding conversion between the calling |
| applications and libxml2, supplementary "convert and latch" functions are |
| provided (See below). To bind the EBCDIC OS/400 system calls and libxml2, |
| an ASCII run-time environment (QADRT) has been used and wrapper functions have |
| been designed. |
| |
| Other problems are: |
| - Source code line length: to be stored in DB2 members, source files may not |
| have lines longer than 100 characters. Some header and documentation files |
| have been modified accordingly. |
| - va_list dereferencing: the OS/400 implementation of va_list type is an array |
| but the compiler forbids explicit array dereferencing. Source files have |
| been updated accordingly. |
| - Depending on the compilation/execution environment, it is possible that |
| stdin/stdout/stderr are not associated with a file descriptor; as a side |
| effect, open() may return a file descriptor value 0, 1 or 2 that is NOT |
| a C standard file. Thus using such a number may be inaccurate. |
| - iconv_open() arguments: OS/400 uses non-standard encoding names and does not |
| support standard names. For this reason, a name wrapper has been designed. |
| - dlopen() (support for xmodule): the function and its corollaries are not |
| provided by the OS/400 library. However a local implementation is provided. |
| |
| |
| Compiling on OS/400: |
| |
| _ As a prerequisite, QADRT development environment must be installed. |
| _ Install the libxml2 source directory in IFS. |
| _ Enter shell (QSH) |
| _ Change current directory to the libxml2 installation directory |
| _ Change current directory to ./os400 |
| _ Edit file iniscript.sh. You may want to change tunable configuration |
| parameters, like debug info generation, optimisation level, listing option, |
| target library, zlib availability, etc. |
| _ Copy any file in the current directory to makelog (i.e.: |
| cp initscript.sh makelog): this is intended to create the makelog file with |
| an ASCII CCSID! |
| _ Enter the command "sh make.sh >makelog 2>&1' |
| _ Examine the makelog file to check for compilation errors. |
| |
| Leaving file initscript.sh unchanged, this will produce the following |
| OS/400 objects: |
| _ Library LIBXML2. All other objects will be stored in this library. |
| _ Modules for all libxml2 units, with full debug info and no code optimization. |
| _ Binding directory LIBXML2_A, to be used at calling program link time for |
| statically binding the modules (specify BNDSRVPGM(QADRTTS QGLDCLNT QGLDBRDR) |
| when creating a program using LIBXML2_A). |
| _ Service program LIBXML2. To be used at calling program run-time |
| when this program has dynamically bound libxml2 at link time. |
| _ Binding directory LIBXML2. To be used to dynamically bind libxml2 when |
| linking a calling program. |
| _ Source file LIBXML. It contains all the header members needed to compile a |
| C/C++ module using libxml2. |
| _ Standard and additional C/C++ libxml2 header members (possibly renamed) in |
| file LIBXML. |
| _ IFS directory /libxml2 with subdirectory include/libxml containing all |
| C/C++ header files for IFS-based compilation. |
| _ Source file LIBXMLRPG. It contains all the include members needed to compile a |
| ILE/RPG module/program using libxml2 (ILE/RPG binding). |
| _ ILE/RPG binding include members (possibly renamed) in file LIBXMLRPG. |
| _ IFS subdirectory /libxml2/include/libxmlrpg containing all ILE/RPG include |
| files for IFS-based compilation. |
| |
| |
| Renamed header files in DB2 members: |
| DB2 member names are limited to 10 characters, thus the following C/C++ |
| header members are renamed as: |
| parserInternals.h --> PARSERINTE |
| schemasInternals.h --> SCHEMASINT |
| xmlautomata.h --> XMLAUTOMAT |
| xmlschemastype.h --> SCHMTYPES |
| xpathInternals.h --> XPATHINTER |
| IFS header files are NOT renamed. |
| ILE/RPG headers are processed likewise. |
| |
| |
| Special programming consideration: |
| |
| QADRT being used, the following points must be considered: |
| _ If static binding is used, service program QADRTTS must be linked too. |
| _ The EBCDIC CCSID used by QADRT is 37 by default, NOT THE JOB'S CCSID. If |
| another EBCDIC CCSID is required, it must be set via a locale through a call |
| to setlocale_a (QADRT's setlocale() ASCII wrapper) with category LC_ALL or |
| LC_CTYPE, or by setting environment variable QADRT_ENV_LOCALE to the locale |
| object path before executing the program. |
| _ Always use *IFSIO or *IFS64IO to compile calling programs. |
| |
| |
| |
| Supplementary (non libxml2 standard) support procedures for OS/400. |
| |
| As cited above, there are some procedures to ease encoding conversion of |
| libxml2 function arguments and results: the mechanism is based on |
| dictionaries. The functions convert a string, latch the result in a dictionary |
| to ensure its persistence and return its address. It is the caller's |
| responsibility to clean the dictionary when it becomes too big or disappears. |
| |
| The procedures are: |
| |
| #include <libxml/transcode.h> |
| |
| const char * xmlTranscodeResult(const xmlChar * s, |
| const char * encoding, |
| xmlDictPtr * dict, |
| void (*freeproc)(const void *)); |
| |
| const xmlChar * xmlTranscodeString(const char * s, |
| const char * encoding, |
| xmlDictPtr * dict); |
| |
| const xmlChar * xmlTranscodeWString(const char * s, |
| const char * encoding, |
| xmlDictPtr * dict); |
| |
| const xmlChar * xmlTranscodeWString(const char * s, |
| const char * encoding, |
| xmlDictPtr * dict); |
| |
| where: |
| s is the string to translate. |
| encoding is the alternate character encoding. If null, the current job's |
| encoding (CCSID) is used. |
| dict is the address of the latching directory. If NULL, the procedure |
| functions as a simple non-latching encoding converter and |
| its result value should be freed by the caller. |
| freeproc is a procedure to release the original string, or NULL. |
| |
| xmlTranscodeResult() converts from UTF-8 to the given alternate encoding. |
| xmlTranscodeString() converts from the given 8-bit encoding to UTF-8 (note that |
| UTF-8 itself is considered as a 8-bit encoding). |
| xmlTranscodeWString() converts from the given 16-bit encoding to UTF-8. |
| xmlTranscodeHString() converts from the given 32-bit encoding to UTF-8. |
| |
| |
| To shorten statements using these functions, shorthands are defined: |
| |
| xmlTR for xmlTranscodeResult |
| xmlTS for xmlTranscodeString |
| xmlTW for xmlTranscodeWString |
| xmlTH for xmlTranscodeHstring |
| |
| These shorthands may be disabled by defining XML_NO_SHORT_NAMES before |
| libxml/transcode.h inclusion. |
| |
| A directory pointer must be preset to NULL before the first call using it to |
| one of the above procedure. |
| |
| To release a latching directory, use function |
| |
| void xmlZapDict(xmlDictPtr * dict); |
| |
| |
| Example: |
| |
| #include <libxml/transcode.h> |
| #include <libxml/tree.h> |
| |
| xmlDocPtr mySimpleXMLDoc(char * element, char * text) |
| { |
| xmlDocPtr doc; |
| xmlNodePtr node; |
| xmlDictPtr dict = NULL; |
| |
| /* element and text are encoded in the current job's encoding. */ |
| |
| doc = xmlNewDoc(); |
| xmlNewTextChild((xmlNodePtr) doc, NULL, xmlTS(element, NULL, |
| &dict), xmlTS(text, NULL, &dict)); |
| xmlZapDict(&dict); |
| return doc; |
| } |
| |
| |
| Additionally, a formatter into latched/dynamic storage is provided: |
| |
| const char * xmlVasprintf(xmlDictPtr * dict, |
| const char * encoding, |
| const xmlChar * fmt, |
| va_list args); |
| |
| |
| xmllint and xmlcatalog programs: |
| |
| These programs are fully implemented at the qshell level, with standard |
| command line options. Links to these are installed in sub-directory bin of |
| the IFS installation directory. |
| CL command interfaces to these programs are also provided with limited |
| support. In particular, interactive mode is not supported and argument count |
| and lengths are limited by the CL command syntax. |
| |
| |
| ILE/RPG binding: |
| |
| All standard types and procedures are provided. Since ILE/RPG does not |
| support macros, they have not been ported. However some of them are emulated |
| as functions: these are the more useful ones (xmlXPathNodeSetGetLength, |
| xmlXPathNodeSetItem, xmlXPathNodeSetIsEmpty, htmlDefaultSubelement, |
| htmlElementAllowedHereDesc, htmlRequiredAttrs) and the global/threaded |
| variables access macros. These variables can be read with function |
| get_xxx(void), where xxxx is the name of the variable; they may be set by |
| calling function set_xxxx(value), where value is of the same type as the |
| variable. |
| |
| The C va_list is not implemented as such in ILE/RPG. Functions implementing |
| va_list and associated methods are provided: |
| |
| /include "libxmlrpg/xmlstdarg" |
| |
| d xmlVaStart pr |
| d list like(xmlVaList) |
| d lastargaddr * value |
| d lastargsize 10u 0 value |
| |
| d xmlVaArg pr |
| d list like(xmlVaList) |
| d dest * value |
| d argsize 10i 0 value |
| |
| d xmlVaEnd pr |
| d list like(xmlVaList) |