About mzc

1.1  mzc Is...

The mzc compiler takes MzScheme (or MrEd) source code and produces either platform-independent byte-code compiled files (.zo files) or platform-specific native-code libraries (.so or .dll files) to be loaded into MzScheme (or MrEd). In the latter mode, mzc provides limited support for interfacing directly to C libraries.

mzc works on either individual files or on collections. (A collection is a group of files that conform to MzScheme's library collection system; see section 16 in PLT MzScheme: Language Manual). In general, mzc works best with code using the module form.

As a convenience for programmers writing low-level MzScheme extensions, mzc can compile and link plain C files that use MzScheme's escheme.h header. This facility is described in Inside PLT MzScheme.

Finally, mzc can perform miscellaneous tasks, such as embedding Scheme code in a copy of the MzScheme (or MrEd) binary to produce a stand-alone executable, or creating .plt distribution archives.

1.1.1  Byte-Code Compilation

A byte-code file typically uses the file extension .zo. The file starts with #~ followed by the byte-code data.

Byte-code files are loaded into MzScheme in the same way as regular Scheme source files (e.g., with load). The #~ marker causes MzScheme's reader to load byte codes instead of normal Scheme expressions. When a .zo file exists in a compiled subdirectory, it is sometimes loaded in place of a source file; see section 3.3 for details.

Byte-code programs produced by mzc run exactly the same as source code compiled by MzScheme directly (assuming the same set of bindings are in place at compile time and load time). In other words, byte-code compilation does not optimize the code any more than MzScheme's normal evaluator. However, a byte-code file can be loaded into MzScheme much faster than a source-code file.

1.1.2  Native-Code Compilation

A native-code file is a platform-specific shared library. Under Windows, native-code files use the extension .dll. Under Mac OS X, native-code files use the extension .dylib. Under Unix, native-code files use the extension .so.

Native-code files are loaded into MzScheme with the load-extension procedure (see section 14.4 in PLT MzScheme: Language Manual). When a native-code file exists in a compiled subdirectory, it is sometimes loaded in place of a source file; see section 3.3 for details.

The native-code compiler attempts to optimize a source program so that it runs faster than the source-code or byte-code version of the program. See section 1.4 for information on obtaining the best possible performance from mzc-compiled programs.

The cffi.ss library of the compiler collection defines Scheme forms, such as c-lambda, for accessing C functions from Scheme. The forms produce run-time errors when interpreted directly or compiled to byte code. See section 2 for further information.

Native-code compilation produces C source code in an intermediate stage; your system must provide an external C compiler to produce native code. The mzc compiler cannot produce native code directly from Scheme code.

The C compiler and compiler flags used by mzc can be adjusted via command line flags.

1.2  mzc Is Not...

mzc does not generally produce stand-alone executables from Scheme source code. The compiler's output is intended to be loaded into MzScheme (or MrEd or DrScheme). However, see also section 5 for information about embedding code into a copy of the MzScheme (or MrEd) executable.

mzc does not translate Scheme code into similar C code. Native-code compilation produces C code that relies on MzScheme to provide run-time support, which includes memory management, closure creation, procedure application, and primitive operations.

1.3  Running mzc

Run mzc from a shell, passing in flags and arguments on the command line.

In this manual, each example command line is shown as follows:

mzc --extension --prefix macros.ss file.ss

To run this example, type the command line into a shell (replacing mzc with the path to mzc on your system, if necessary).

Simple on-line help is available for mzc's command-line arguments by running mzc with the -h or --help flag.

1.4  Native Code Optimization from mzc

Compiling a program to native code with mzc can provide significant speedups compared to interpreting byte code (or running the program directly from source code), but only for certain kinds of programs. The speedup from native-code compilation is typically due to two optimizations:

Programs that permit these optimizations also to encourage a host of other optimizations, such as procedure inlining (for programmer-defined procedures) and static closure detection. In general, module-based programs provide the most opportunities for optimization.

Native-code compilation rarely produces significant speedup for programs that are not loop-intensive, programs that are heavily object-oriented, programs that are allocation-intensive, or programs that exploit built-in procedures (e.g., list operations, regular expression matching, or file manipulations) to perform most of the program's work.


1 The compiler cannot always prove that module definitions have been evaluated before the corresponding variable is used in an expression. Use the -v or --verbose flag to check whether mzc reports a ``last known module binding'' warning when compiling a module expression, which indicates that definitions after a particular line in the source file might be referenced before they are defined.