MKNMZ

Section: Namazu Project (1)
Updated: January 2006
Index

NAME

mknmz - an indexer of Namazu

SYNOPSIS

mknmz [options] <target>...

DESCRIPTION

mknmz 2.0.15, an indexer of Namazu.

Target files:

-a, --all: target all files.
-t, --media-type=MTYPE: set the media type for all target files to MTYPE.
-h, --mailnews: same as --media-type='message/rfc822'
--mhonarc: same as --media-type='text/html; x-type=mhonarc'
-F, --target-list=FILE: load FILE which contains a list of target files.
--allow=PATTERN: set PATTERN for file names which should be allowed.
--deny=PATTERN: set PATTERN for file names which should be denied.
--exclude=PATTERN: set PATTERN for pathnames which should be excluded.
-e, --robots: exclude HTML files containing <meta name="ROBOTS" content="NOINDEX">
-M, --meta: handle HTML meta tags for field-specified search.
-r, --replace=CODE: set CODE for replacing URI.
--html-split: split an HTML file with <a name="..."> anchors.
--mtime=NUM: limit by mtime just like find(1)'s -mtime option. e.g., -50 for recent 50 days, +50 for older than 50.

Morphological Analysis:

-b, --use-mecab: use MeCab for analyzing Japanese.
-c, --use-chasen: use ChaSen for analyzing Japanese.
-k, --use-kakasi: use KAKASI for analyzing Japanese.
-m, --use-chasen-noun: use ChaSen for extracting only nouns.
-L, --indexing-lang=LANG index with language specific processing.

Text Operations:

-E, --no-edge-symbol: remove symbols on edge of word.
-G, --no-okurigana: remove Okurigana in word.
-H, --no-hiragana: ignore words consist of Hiragana only.
-K, --no-symbol: remove symbols.
--decode-base64: decode base64 bodies within multipart entities.

Summarization:

-U, --no-encode-uri: do not encode URI.
-x, --no-heading-summary do not make summary with HTML's headings.

Index Construction:

--update=INDEX: set INDEX for updating.
-z, --check-filesize: detect file size changed.
-Y, --no-delete: do not detect removed documents.
-Z, --no-update: do not detect update and deleted documents.

Miscellaneous:

-s, --checkpoint: turn on the checkpoint mechanism.
-C, --show-config: show the current configuration.
-f, --config=FILE: use FILE as a config file.
-I, --include=FILE: include your customization FILE.
-O, --output-dir=DIR: set DIR to output the index.
-T, --template-dir=DIR: set DIR having NMZ.{head,foot,body}.*.
-q, --quiet: suppress status messages during execution.
-v, --version: show the version of namazu and exit.
-V, --verbose: be verbose.
-d, --debug: be debug mode.
--help: show this help and exit.
--norc: do not read the personal initialization files.
--: Terminate option list.

REPORTING BUGS

Report bugs to <http://www.namazu.org/trac-namazu/trac.cgi> or <bug-namazu@namazu.org>.

COPYRIGHT

This is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

Index

NAME

SYNOPSIS

DESCRIPTION

Target files:
Morphological Analysis:
Text Operations:
Summarization:
Index Construction:
Miscellaneous:

REPORTING BUGS

COPYRIGHT

This document was created by man2html, using the manual pages.
Time: 20:08:01 GMT, January 30, 2006