449 lines
18 KiB
Plaintext
449 lines
18 KiB
Plaintext
|
Compiling PCRE on non-Unix systems
|
||
|
----------------------------------
|
||
|
|
||
|
This document contains the following sections:
|
||
|
|
||
|
General
|
||
|
Generic instructions for the PCRE C library
|
||
|
The C++ wrapper functions
|
||
|
Building for virtual Pascal
|
||
|
Stack size in Windows environments
|
||
|
Linking programs in Windows environments
|
||
|
Comments about Win32 builds
|
||
|
Building PCRE on Windows with CMake
|
||
|
Use of relative paths with CMake on Windows
|
||
|
Testing with runtest.bat
|
||
|
Building under Windows with BCC5.5
|
||
|
Building PCRE on OpenVMS
|
||
|
|
||
|
|
||
|
GENERAL
|
||
|
|
||
|
I (Philip Hazel) have no experience of Windows or VMS sytems and how their
|
||
|
libraries work. The items in the PCRE distribution and Makefile that relate to
|
||
|
anything other than Unix-like systems are untested by me.
|
||
|
|
||
|
There are some other comments and files (including some documentation in CHM
|
||
|
format) in the Contrib directory on the FTP site:
|
||
|
|
||
|
ftp://ftp.csx.cam.ac.uk/pub/software/programming/pcre/Contrib
|
||
|
|
||
|
If you want to compile PCRE for a non-Unix system (especially for a system that
|
||
|
does not support "configure" and "make" files), note that the basic PCRE
|
||
|
library consists entirely of code written in Standard C, and so should compile
|
||
|
successfully on any system that has a Standard C compiler and library. The C++
|
||
|
wrapper functions are a separate issue (see below).
|
||
|
|
||
|
The PCRE distribution includes a "configure" file for use by the Configure/Make
|
||
|
build system, as found in many Unix-like environments. There is also support
|
||
|
support for CMake, which some users prefer, in particular in Windows
|
||
|
environments. There are some instructions for CMake under Windows in the
|
||
|
section entitled "Building PCRE with CMake" below. CMake can also be used to
|
||
|
build PCRE in Unix-like systems.
|
||
|
|
||
|
|
||
|
GENERIC INSTRUCTIONS FOR THE PCRE C LIBRARY
|
||
|
|
||
|
The following are generic comments about building the PCRE C library "by hand".
|
||
|
|
||
|
(1) Copy or rename the file config.h.generic as config.h, and edit the macro
|
||
|
settings that it contains to whatever is appropriate for your environment.
|
||
|
In particular, if you want to force a specific value for newline, you can
|
||
|
define the NEWLINE macro. When you compile any of the PCRE modules, you
|
||
|
must specify -DHAVE_CONFIG_H to your compiler so that config.h is included
|
||
|
in the sources.
|
||
|
|
||
|
An alternative approach is not to edit config.h, but to use -D on the
|
||
|
compiler command line to make any changes that you need to the
|
||
|
configuration options. In this case -DHAVE_CONFIG_H must not be set.
|
||
|
|
||
|
NOTE: There have been occasions when the way in which certain parameters
|
||
|
in config.h are used has changed between releases. (In the configure/make
|
||
|
world, this is handled automatically.) When upgrading to a new release,
|
||
|
you are strongly advised to review config.h.generic before re-using what
|
||
|
you had previously.
|
||
|
|
||
|
(2) Copy or rename the file pcre.h.generic as pcre.h.
|
||
|
|
||
|
(3) EITHER:
|
||
|
Copy or rename file pcre_chartables.c.dist as pcre_chartables.c.
|
||
|
|
||
|
OR:
|
||
|
Compile dftables.c as a stand-alone program (using -DHAVE_CONFIG_H if
|
||
|
you have set up config.h), and then run it with the single argument
|
||
|
"pcre_chartables.c". This generates a set of standard character tables
|
||
|
and writes them to that file. The tables are generated using the default
|
||
|
C locale for your system. If you want to use a locale that is specified
|
||
|
by LC_xxx environment variables, add the -L option to the dftables
|
||
|
command. You must use this method if you are building on a system that
|
||
|
uses EBCDIC code.
|
||
|
|
||
|
The tables in pcre_chartables.c are defaults. The caller of PCRE can
|
||
|
specify alternative tables at run time.
|
||
|
|
||
|
(4) Ensure that you have the following header files:
|
||
|
|
||
|
pcre_internal.h
|
||
|
ucp.h
|
||
|
|
||
|
(5) Also ensure that you have the following file, which is #included as source
|
||
|
when building a debugging version of PCRE, and is also used by pcretest.
|
||
|
|
||
|
pcre_printint.src
|
||
|
|
||
|
(6) Compile the following source files, setting -DHAVE_CONFIG_H as a compiler
|
||
|
option if you have set up config.h with your configuration, or else use
|
||
|
other -D settings to change the configuration as required.
|
||
|
|
||
|
pcre_chartables.c
|
||
|
pcre_compile.c
|
||
|
pcre_config.c
|
||
|
pcre_dfa_exec.c
|
||
|
pcre_exec.c
|
||
|
pcre_fullinfo.c
|
||
|
pcre_get.c
|
||
|
pcre_globals.c
|
||
|
pcre_info.c
|
||
|
pcre_maketables.c
|
||
|
pcre_newline.c
|
||
|
pcre_ord2utf8.c
|
||
|
pcre_refcount.c
|
||
|
pcre_study.c
|
||
|
pcre_tables.c
|
||
|
pcre_try_flipped.c
|
||
|
pcre_ucd.c
|
||
|
pcre_valid_utf8.c
|
||
|
pcre_version.c
|
||
|
pcre_xclass.c
|
||
|
|
||
|
Make sure that you include -I. in the compiler command (or equivalent for
|
||
|
an unusual compiler) so that all included PCRE header files are first
|
||
|
sought in the current directory. Otherwise you run the risk of picking up
|
||
|
a previously-installed file from somewhere else.
|
||
|
|
||
|
(7) Now link all the compiled code into an object library in whichever form
|
||
|
your system keeps such libraries. This is the basic PCRE C library. If
|
||
|
your system has static and shared libraries, you may have to do this once
|
||
|
for each type.
|
||
|
|
||
|
(8) Similarly, compile pcreposix.c (remembering -DHAVE_CONFIG_H if necessary)
|
||
|
and link the result (on its own) as the pcreposix library.
|
||
|
|
||
|
(9) Compile the test program pcretest.c (again, don't forget -DHAVE_CONFIG_H).
|
||
|
This needs the functions in the pcre and pcreposix libraries when linking.
|
||
|
It also needs the pcre_printint.src source file, which it #includes.
|
||
|
|
||
|
(10) Run pcretest on the testinput files in the testdata directory, and check
|
||
|
that the output matches the corresponding testoutput files. Note that the
|
||
|
supplied files are in Unix format, with just LF characters as line
|
||
|
terminators. You may need to edit them to change this if your system uses
|
||
|
a different convention. If you are using Windows, you probably should use
|
||
|
the wintestinput3 file instead of testinput3 (and the corresponding output
|
||
|
file). This is a locale test; wintestinput3 sets the locale to "french"
|
||
|
rather than "fr_FR", and there some minor output differences.
|
||
|
|
||
|
(11) If you want to use the pcregrep command, compile and link pcregrep.c; it
|
||
|
uses only the basic PCRE library (it does not need the pcreposix library).
|
||
|
|
||
|
|
||
|
THE C++ WRAPPER FUNCTIONS
|
||
|
|
||
|
The PCRE distribution also contains some C++ wrapper functions and tests,
|
||
|
contributed by Google Inc. On a system that can use "configure" and "make",
|
||
|
the functions are automatically built into a library called pcrecpp. It should
|
||
|
be straightforward to compile the .cc files manually on other systems. The
|
||
|
files called xxx_unittest.cc are test programs for each of the corresponding
|
||
|
xxx.cc files.
|
||
|
|
||
|
|
||
|
BUILDING FOR VIRTUAL PASCAL
|
||
|
|
||
|
A script for building PCRE using Borland's C++ compiler for use with VPASCAL
|
||
|
was contributed by Alexander Tokarev. Stefan Weber updated the script and added
|
||
|
additional files. The following files in the distribution are for building PCRE
|
||
|
for use with VP/Borland: makevp_c.txt, makevp_l.txt, makevp.bat, pcregexp.pas.
|
||
|
|
||
|
|
||
|
STACK SIZE IN WINDOWS ENVIRONMENTS
|
||
|
|
||
|
The default processor stack size of 1Mb in some Windows environments is too
|
||
|
small for matching patterns that need much recursion. In particular, test 2 may
|
||
|
fail because of this. Normally, running out of stack causes a crash, but there
|
||
|
have been cases where the test program has just died silently. See your linker
|
||
|
documentation for how to increase stack size if you experience problems. The
|
||
|
Linux default of 8Mb is a reasonable choice for the stack, though even that can
|
||
|
be too small for some pattern/subject combinations.
|
||
|
|
||
|
PCRE has a compile configuration option to disable the use of stack for
|
||
|
recursion so that heap is used instead. However, pattern matching is
|
||
|
significantly slower when this is done. There is more about stack usage in the
|
||
|
"pcrestack" documentation.
|
||
|
|
||
|
|
||
|
LINKING PROGRAMS IN WINDOWS ENVIRONMENTS
|
||
|
|
||
|
If you want to statically link a program against a PCRE library in the form of
|
||
|
a non-dll .a file, you must define PCRE_STATIC before including pcre.h,
|
||
|
otherwise the pcre_malloc() and pcre_free() exported functions will be declared
|
||
|
__declspec(dllimport), with unwanted results.
|
||
|
|
||
|
|
||
|
CALLING CONVENTIONS IN WINDOWS ENVIRONMENTS
|
||
|
|
||
|
It is possible to compile programs to use different calling conventions using
|
||
|
MSVC. Search the web for "calling conventions" for more information. To make it
|
||
|
easier to change the calling convention for the exported functions in the
|
||
|
PCRE library, the macro PCRE_CALL_CONVENTION is present in all the external
|
||
|
definitions. It can be set externally when compiling (e.g. in CFLAGS). If it is
|
||
|
not set, it defaults to empty; the default calling convention is then used
|
||
|
(which is what is wanted most of the time).
|
||
|
|
||
|
|
||
|
COMMENTS ABOUT WIN32 BUILDS (see also "BUILDING PCRE WITH CMAKE" below)
|
||
|
|
||
|
There are two ways of building PCRE using the "configure, make, make install"
|
||
|
paradigm on Windows systems: using MinGW or using Cygwin. These are not at all
|
||
|
the same thing; they are completely different from each other. There is also
|
||
|
support for building using CMake, which some users find a more straightforward
|
||
|
way of building PCRE under Windows. However, the tests are not run
|
||
|
automatically when CMake is used.
|
||
|
|
||
|
The MinGW home page (http://www.mingw.org/) says this:
|
||
|
|
||
|
MinGW: A collection of freely available and freely distributable Windows
|
||
|
specific header files and import libraries combined with GNU toolsets that
|
||
|
allow one to produce native Windows programs that do not rely on any
|
||
|
3rd-party C runtime DLLs.
|
||
|
|
||
|
The Cygwin home page (http://www.cygwin.com/) says this:
|
||
|
|
||
|
Cygwin is a Linux-like environment for Windows. It consists of two parts:
|
||
|
|
||
|
. A DLL (cygwin1.dll) which acts as a Linux API emulation layer providing
|
||
|
substantial Linux API functionality
|
||
|
|
||
|
. A collection of tools which provide Linux look and feel.
|
||
|
|
||
|
The Cygwin DLL currently works with all recent, commercially released x86 32
|
||
|
bit and 64 bit versions of Windows, with the exception of Windows CE.
|
||
|
|
||
|
On both MinGW and Cygwin, PCRE should build correctly using:
|
||
|
|
||
|
./configure && make && make install
|
||
|
|
||
|
This should create two libraries called libpcre and libpcreposix, and, if you
|
||
|
have enabled building the C++ wrapper, a third one called libpcrecpp. These are
|
||
|
independent libraries: when you like with libpcreposix or libpcrecpp you must
|
||
|
also link with libpcre, which contains the basic functions. (Some earlier
|
||
|
releases of PCRE included the basic libpcre functions in libpcreposix. This no
|
||
|
longer happens.)
|
||
|
|
||
|
A user submitted a special-purpose patch that makes it easy to create
|
||
|
"pcre.dll" under mingw32 using the "msys" environment. It provides "pcre.dll"
|
||
|
as a special target. If you use this target, no other files are built, and in
|
||
|
particular, the pcretest and pcregrep programs are not built. An example of how
|
||
|
this might be used is:
|
||
|
|
||
|
./configure --enable-utf --disable-cpp CFLAGS="-03 -s"; make pcre.dll
|
||
|
|
||
|
Using Cygwin's compiler generates libraries and executables that depend on
|
||
|
cygwin1.dll. If a library that is generated this way is distributed,
|
||
|
cygwin1.dll has to be distributed as well. Since cygwin1.dll is under the GPL
|
||
|
licence, this forces not only PCRE to be under the GPL, but also the entire
|
||
|
application. A distributor who wants to keep their own code proprietary must
|
||
|
purchase an appropriate Cygwin licence.
|
||
|
|
||
|
MinGW has no such restrictions. The MinGW compiler generates a library or
|
||
|
executable that can run standalone on Windows without any third party dll or
|
||
|
licensing issues.
|
||
|
|
||
|
But there is more complication:
|
||
|
|
||
|
If a Cygwin user uses the -mno-cygwin Cygwin gcc flag, what that really does is
|
||
|
to tell Cygwin's gcc to use the MinGW gcc. Cygwin's gcc is only acting as a
|
||
|
front end to MinGW's gcc (if you install Cygwin's gcc, you get both Cygwin's
|
||
|
gcc and MinGW's gcc). So, a user can:
|
||
|
|
||
|
. Build native binaries by using MinGW or by getting Cygwin and using
|
||
|
-mno-cygwin.
|
||
|
|
||
|
. Build binaries that depend on cygwin1.dll by using Cygwin with the normal
|
||
|
compiler flags.
|
||
|
|
||
|
The test files that are supplied with PCRE are in Unix format, with LF
|
||
|
characters as line terminators. It may be necessary to change the line
|
||
|
terminators in order to get some of the tests to work. We hope to improve
|
||
|
things in this area in future.
|
||
|
|
||
|
|
||
|
BUILDING PCRE ON WINDOWS WITH CMAKE
|
||
|
|
||
|
CMake is an alternative build facility that can be used instead of the
|
||
|
traditional Unix "configure". CMake version 2.4.7 supports Borland makefiles,
|
||
|
MinGW makefiles, MSYS makefiles, NMake makefiles, UNIX makefiles, Visual Studio
|
||
|
6, Visual Studio 7, Visual Studio 8, and Watcom W8. The following instructions
|
||
|
were contributed by a PCRE user.
|
||
|
|
||
|
1. Download CMake 2.4.7 or above from http://www.cmake.org/, install and ensure
|
||
|
that cmake\bin is on your path.
|
||
|
|
||
|
2. Unzip (retaining folder structure) the PCRE source tree into a source
|
||
|
directory such as C:\pcre.
|
||
|
|
||
|
3. Create a new, empty build directory: C:\pcre\build\
|
||
|
|
||
|
4. Run CMakeSetup from the Shell envirornment of your build tool, e.g., Msys
|
||
|
for Msys/MinGW or Visual Studio Command Prompt for VC/VC++
|
||
|
|
||
|
5. Enter C:\pcre\pcre-xx and C:\pcre\build for the source and build
|
||
|
directories, respectively
|
||
|
|
||
|
6. Hit the "Configure" button.
|
||
|
|
||
|
7. Select the particular IDE / build tool that you are using (Visual Studio,
|
||
|
MSYS makefiles, MinGW makefiles, etc.)
|
||
|
|
||
|
8. The GUI will then list several configuration options. This is where you can
|
||
|
enable UTF-8 support, etc.
|
||
|
|
||
|
9. Hit "Configure" again. The adjacent "OK" button should now be active.
|
||
|
|
||
|
10. Hit "OK".
|
||
|
|
||
|
11. The build directory should now contain a usable build system, be it a
|
||
|
solution file for Visual Studio, makefiles for MinGW, etc.
|
||
|
|
||
|
|
||
|
USE OF RELATIVE PATHS WITH CMAKE ON WINDOWS
|
||
|
|
||
|
A PCRE user comments as follows:
|
||
|
|
||
|
I thought that others may want to know the current state of
|
||
|
CMAKE_USE_RELATIVE_PATHS support on Windows.
|
||
|
|
||
|
Here it is:
|
||
|
-- AdditionalIncludeDirectories is only partially modified (only the
|
||
|
first path - see below)
|
||
|
-- Only some of the contained file paths are modified - shown below for
|
||
|
pcre.vcproj
|
||
|
-- It properly modifies
|
||
|
|
||
|
I am sure CMake people can fix that if they want to. Until then one will
|
||
|
need to replace existing absolute paths in project files with relative
|
||
|
paths manually (e.g. from VS) - relative to project file location. I did
|
||
|
just that before being told to try CMAKE_USE_RELATIVE_PATHS. Not a big
|
||
|
deal.
|
||
|
|
||
|
AdditionalIncludeDirectories="E:\builds\pcre\build;E:\builds\pcre\pcre-7.5;"
|
||
|
AdditionalIncludeDirectories=".;E:\builds\pcre\pcre-7.5;"
|
||
|
|
||
|
RelativePath="pcre.h">
|
||
|
RelativePath="pcre_chartables.c">
|
||
|
RelativePath="pcre_chartables.c.rule">
|
||
|
|
||
|
|
||
|
TESTING WITH RUNTEST.BAT
|
||
|
|
||
|
1. Copy RunTest.bat into the directory where pcretest.exe has been created.
|
||
|
|
||
|
2. Edit RunTest.bat and insert a line that indentifies the relative location of
|
||
|
the pcre source, e.g.:
|
||
|
|
||
|
set srcdir=..\pcre-7.4-RC3
|
||
|
|
||
|
3. Run RunTest.bat from a command shell environment. Test outputs will
|
||
|
automatically be compared to expected results, and discrepancies will
|
||
|
identified in the console output.
|
||
|
|
||
|
4. To test pcrecpp, run pcrecpp_unittest.exe, pcre_stringpiece_unittest.exe and
|
||
|
pcre_scanner_unittest.exe.
|
||
|
|
||
|
|
||
|
BUILDING UNDER WINDOWS WITH BCC5.5
|
||
|
|
||
|
Michael Roy sent these comments about building PCRE under Windows with BCC5.5:
|
||
|
|
||
|
Some of the core BCC libraries have a version of PCRE from 1998 built in,
|
||
|
which can lead to pcre_exec() giving an erroneous PCRE_ERROR_NULL from a
|
||
|
version mismatch. I'm including an easy workaround below, if you'd like to
|
||
|
include it in the non-unix instructions:
|
||
|
|
||
|
When linking a project with BCC5.5, pcre.lib must be included before any of
|
||
|
the libraries cw32.lib, cw32i.lib, cw32mt.lib, and cw32mti.lib on the command
|
||
|
line.
|
||
|
|
||
|
|
||
|
BUILDING UNDER WINDOWS CE WITH VISUAL STUDIO 200x
|
||
|
|
||
|
Vincent Richomme sent a zip archive of files to help with this process. They
|
||
|
can be found in the file "pcre-vsbuild.zip" in the Contrib directory of the FTP
|
||
|
site.
|
||
|
|
||
|
|
||
|
BUILDING PCRE ON OPENVMS
|
||
|
|
||
|
Dan Mooney sent the following comments about building PCRE on OpenVMS. They
|
||
|
relate to an older version of PCRE that used fewer source files, so the exact
|
||
|
commands will need changing. See the current list of source files above.
|
||
|
|
||
|
"It was quite easy to compile and link the library. I don't have a formal
|
||
|
make file but the attached file [reproduced below] contains the OpenVMS DCL
|
||
|
commands I used to build the library. I had to add #define
|
||
|
POSIX_MALLOC_THRESHOLD 10 to pcre.h since it was not defined anywhere.
|
||
|
|
||
|
The library was built on:
|
||
|
O/S: HP OpenVMS v7.3-1
|
||
|
Compiler: Compaq C v6.5-001-48BCD
|
||
|
Linker: vA13-01
|
||
|
|
||
|
The test results did not match 100% due to the issues you mention in your
|
||
|
documentation regarding isprint(), iscntrl(), isgraph() and ispunct(). I
|
||
|
modified some of the character tables temporarily and was able to get the
|
||
|
results to match. Tests using the fr locale did not match since I don't have
|
||
|
that locale loaded. The study size was always reported to be 3 less than the
|
||
|
value in the standard test output files."
|
||
|
|
||
|
=========================
|
||
|
$! This DCL procedure builds PCRE on OpenVMS
|
||
|
$!
|
||
|
$! I followed the instructions in the non-unix-use file in the distribution.
|
||
|
$!
|
||
|
$ COMPILE == "CC/LIST/NOMEMBER_ALIGNMENT/PREFIX_LIBRARY_ENTRIES=ALL_ENTRIES
|
||
|
$ COMPILE DFTABLES.C
|
||
|
$ LINK/EXE=DFTABLES.EXE DFTABLES.OBJ
|
||
|
$ RUN DFTABLES.EXE/OUTPUT=CHARTABLES.C
|
||
|
$ COMPILE MAKETABLES.C
|
||
|
$ COMPILE GET.C
|
||
|
$ COMPILE STUDY.C
|
||
|
$! I had to set POSIX_MALLOC_THRESHOLD to 10 in PCRE.H since the symbol
|
||
|
$! did not seem to be defined anywhere.
|
||
|
$! I edited pcre.h and added #DEFINE SUPPORT_UTF8 to enable UTF8 support.
|
||
|
$ COMPILE PCRE.C
|
||
|
$ LIB/CREATE PCRE MAKETABLES.OBJ, GET.OBJ, STUDY.OBJ, PCRE.OBJ
|
||
|
$! I had to set POSIX_MALLOC_THRESHOLD to 10 in PCRE.H since the symbol
|
||
|
$! did not seem to be defined anywhere.
|
||
|
$ COMPILE PCREPOSIX.C
|
||
|
$ LIB/CREATE PCREPOSIX PCREPOSIX.OBJ
|
||
|
$ COMPILE PCRETEST.C
|
||
|
$ LINK/EXE=PCRETEST.EXE PCRETEST.OBJ, PCRE/LIB, PCREPOSIX/LIB
|
||
|
$! C programs that want access to command line arguments must be
|
||
|
$! defined as a symbol
|
||
|
$ PCRETEST :== "$ SYS$ROADSUSERS:[DMOONEY.REGEXP]PCRETEST.EXE"
|
||
|
$! Arguments must be enclosed in quotes.
|
||
|
$ PCRETEST "-C"
|
||
|
$! Test results:
|
||
|
$!
|
||
|
$! The test results did not match 100%. The functions isprint(), iscntrl(),
|
||
|
$! isgraph() and ispunct() on OpenVMS must not produce the same results
|
||
|
$! as the system that built the test output files provided with the
|
||
|
$! distribution.
|
||
|
$!
|
||
|
$! The study size did not match and was always 3 less on OpenVMS.
|
||
|
$!
|
||
|
$! Locale could not be set to fr
|
||
|
$!
|
||
|
=========================
|
||
|
|
||
|
Last Updated: 17 March 2009
|
||
|
****
|