GutenMark Download Page
Attractively formatting Project Gutenberg texts
home
features
download
usage
FAQ
changelog
buglist
links
developer

Contents

License
Downloading GutenMark
Installing GutenMark
Compiling GutenMark
Other Stuff You Might Want

License

GutenMark is freely available under the terms of the GNU General Public License (GPL).  You may view the text of the GPL here , or you may visit the Free Software Foundation for more explanation.


Downloading GutenMark

The "base package" is a single archive file containing all of the source code and executable programs for all directly supported platforms (Win32, Linux-Intel, Linux-PPC,  Mac OS X, and FreeBSD).  Simply choose the archive format (ZIP or TAR) most suitable for your particular platform.  The documentation is provided as a separate archive file, because of its size; the documentation is just a copy of this website (except possibly not as up-to-date), and therefore you may not want to download it.
Current Software
Description
Download software
Download documentation
Base package
  GutenMark-20011201.zip
  GutenMark-docs-20011201.zip
Base package
GutenMark-20011201.tar.gz 
GutenMark-docs-20011201.tar.gz
Configuration file:  Sorry!  I forgot to put it in the base package.
 GutenMark.cfg
n/a
Source only, for developers:  Sorry!  I failed to notice that the source code in the 20011201 snapshot was formatted unreadably.
src-20011205.tar.gz
n/a
Source only, for developers.  This is the current development version, and is probably more recent than the release version.
GutenMark_source_dev.tar.gz
n/a

The "wordlists" and "namelists" are optional files that you can download or not, as you choose.   The wordlists are categorized as highly recommended, recommended , or available, based on my own admittedly subjective experience.   Click here for an extended explanation of what wordlists do .

Wordlists and Namelists

Description
Version
Download
My own special English wordlist
Nov. 29, 2001
  1K
U.S. namelist
Nov. 10, 2001
348K
U.S. place names (NEW)
Dec. 18, 2001
144K
French namelist
 Nov. 11, 2001
7K
English wordlist
 Nov. 10, 2001
449K
French wordlist
 Nov. 17, 2001
373K
German wordlist
 Nov. 24, 2001
582K
Older, smaller German wordlist
Nov. 11, 2001
209K
Latin wordlist
 Nov. 16, 2001
195K
Italian wordlist
 Nov. 11, 2001
383K
Spanish wordlist
 Nov. 11, 2001
322K
Non-U.S. place names (NEW)
Dec. 22, 2001
5992K (Really really big!!)
Norwegian wordlist
 Nov. 16, 2001
2078K (Really big!)
Gaelic wordlist
 Nov. 11, 2001
298K
Danish wordlist
 Nov. 11, 2001
558K
Swedish wordlist
 Nov. 11, 2001
254K
Finnish wordlist
 Nov. 11, 2001
285K
 My own special non-English wordlist
Nov. 24, 2001
 1K
All wordlists and namelists
n/a
  via FTP

Older versions of the software

Description
Download
GutenMark base package + docs.  No wordlists needed.
  GutenMark-20011113.zip
GutenMark base package + docs.  No wordlists needed.
  GutenMark-20011113.tar.gz
GutenMark base package + docs.  No wordlists needed.
GutenMark-20011102.zip
GutenMark base package + docs.  No wordlists needed.
GutenMark-20011102.tar.gz
All software and documentation
via FTP


Installing GutenMark

... on Win32

  1. Unzip GutenMark-xxx.zip with WinZip, pkunzip, or whatever you have that's appropriate.  You'll find GutenMark.exe under the GutenMark\Win32 directory.
  2. Add this directory to your PATH, or copy GutenMark.exe to some directory that's already in your path.
  3. Place the file GutenMark.cfg and any wordlists or namelists into the directory from which you intend to run the GutenMark command-line program.  Typically, this is the same as the directory containing your etexts.
  4. Don't uncompress the wordlists.
  5. Depending on the wordlists you've downloaded, the native languages of the etexts you're interested in, and your own personal tastes, you may want to reconfigure the software.  In most cases, the default configuration should be fine.

... on Linux (Intel or PPC), Mac OS X, or FreeBSD

  1. gunzip GutenMark-xxx.tar.gz
  2. tar -xf GutenMark-xxx.tar
  3. Under the GutenMark directory will subdirectories LinuxIntel, LinuxPPC, MacOS-X, and FreeBSD.  Each subdirectory contains an executable program called "GutenMark" suitable for the indicated platform.   Add the appropriate directory to your path, or copy the executable to some directory that's already in your path.
  4. Place the file GutenMark.cfg and any wordlists or namelists into the directory from which you intend to run the GutenMark command-line program.  Typically, this is the same as the directory containing your etexts.
  5. Don't uncompress the wordlists.
  6. Depending on the wordlists you've downloaded, the native languages of the etexts you're interested in, and your own personal tastes, you may want to reconfigure the software.  In most cases, the default configuration should be fine.

Compiling GutenMark

If you don't have any of the platforms for which an executable program is supplied, or if you would like to modify the program, then you need to compile GutenMark yourself.  This is easy on any system that has the GNU compiler gcc and the GNU make program.  You can obtain gcc and make for free from GNU .  When compiling for Win32, the version of gcc called mingw32 (see Mumit Khan's web site ) is used.

NOTE:  In versions later than 20011113, support for Borland's free C++ compiler (see  Borland's web site ) has been dropped, because it was just too much effort for me without knowing if anyone was interested.  If for some reason you don't want to use mingw32, and if you figure out how to get other C compilers such as Borland's or Microsoft's to work, tell me ; I'll post the instructions here.

Requirements

You need to have the compression library zlib installed.  This can be obtained for free from www.zlib.org , but is already present on every *NIX system I personally have tried.  For Win32, I've included a pre-compiled zlib library with the GutenMark distribution, so you don't have to worry about it.

... on Win32

Unzip the source code, and change to the GutenMark/src from the DOS command line. To compile with mingw32,
make GutenMark.exe
(This assumes that the name of the GNU make program that you got with mingw32 is actually accessible by typing "make".  If this calls up some other make program, such as Microsoft's or Borland's, then this will not work properly.)  In addition to compiling GutenMark.exe, this will attempt to test the compilation by running GutenMark.exe to produce sample HTML file (bldhb10.html) which it compares to an equivalent HTML file (bldhb10.txt.html) provided with the distribution.  (For versions prior to 20011205, this comparison will fail because GutenMark embeds the compilation date within the HTML markup.  However, only a single line of the HTML file, albeit a long one, will fail the comparison.  If more than one HTML lines fail to match, then the executable is not working properly.)

... on *NIX

gunzip GutenMark-current.tar.gz
tar -xf GutenMark-current.tar
cd GutenMark/src
make
On FreeBSD, the GNU make program is actually called gmake .  In addition to compiling GutenMark, this will attempt to test the compilation by running GutenMark to produce sample HTML file (bldhb10.html) which it compares to an equivalent HTML file (bldhb10.txt.html) provided with the distribution.  (For versions prior to 20011205, this comparison will fail because GutenMark embeds the compilation date within the HTML markup.  However, only a single line of the HTML file, albeit a long one, will fail the comparison.  If more than one HTML lines fail to match, then the executable is not working properly.)


Other Stuff You Might Want

The function of GutenMark is merely to convert the Project Gutenberg etexts to marked-up HTML or LaTeX.  If that's all you want -- if you want to read the etext online, or to set up a web site that displays PG texts in HTML, or if you're fine with printing etexts from your browser, or if you want to use the HTML as a starting point for further markup -- then you're all set!

If, on the other hand, you are looking for an end-to-end solution that can produce attractive printable texts like this sample, then you need some better way of printing HTML than your browser can provide.  You could, of course, load the HTML into Microsoft Word or some other word processing program, and manipulate the document format manually.

The solution I would choose instead is to use a utility program that can convert HTML to Postscript printer language, or to PDF format.  Several such free utilities are available.

Description
Sample
page
Version
Configuration
file
8.5"×5.5" 9pt New Century Schoolbook font   page9schoolbook.pdf Nov. 18, 2001  half9schoolbook.rc
8.5"×5.5" 10pt Times Roman font   page10times.pdf Nov. 17, 2001  half10times.rc
8.5"×5.5" 10pt Bookman font   page10bookman.pdf Nov. 17, 2001  half10bookman.rc
8.5"×5.5" 12pt New Century Schoolbook font   page12schoolbook.pdf Nov. 25, 2001  half12schoolbook.rc
  • htmldoc is available for either for Win32 or in source-code form (for Linux systems), and has some very nice properties.  I personally find it a little buggy, but it's apparently under active development and can presumably only get better.  The main problem is that it is very bad at right justification (or at least, I haven't figured it out), and so you need to use ragged-right text.
  • Many other means of converting the HTML to alternate forms for printing or editing are also available.  While I have not used any of them myself, here are some ideas:

    ©2001 Ronald S. Burkey
    Last updated 12/28/01 by RSB.  Contact me .