NAME

txt2graph - generates a dot-file from text-dependencies


DOWNLOAD

try http://andreas-romeyke.de/txt2graph/, the changelog is on http://andreas-romeyke.de/txt2graph/CHANGES,

latest tarball is on http://andreas-romeyke.de/txt2graph/txt2graph1.2.3.tar.gz


SYNOPSIS

txt2graph visualized the structure and dependencies of a text readed from stdin and generates a dot-File for graphviz to stdout.


DEPENDENCIES


DESCRIPTION

What it does

txt2graph should visualisize the structure and dependencies of a text, because I needed it to analyze and understand the principles how my own text-compressor should work for good results. With txt2graph it is possible to find the weights in a document. This tool is also useful to compare documents in a similar way a ngram-tool will do. With usage of filterfile- and frequency-option you get little "topic"-maps of documents.

If you find another usage, please contact me.

How it works

txt2graph reads a text document from stdin, removes all non-alphas and generates an array (list) of words. Then it converts german-umlauts, because graphviz can only handle clean ASCII as node-description and output a dot-file for a directed or an undirected graph.

Warning

The generated dot-file will be typically 4-5fold of original document. This means also that dot or neato consumes very much time.


AUTHOR and COPYRIGHT

txt2graph (c)2001-2003 by Andreas Romeyke (andreas.romeyke@web.de)


LICENSE

txt2graph is distributed under the terms of the GNU General Public License, you find a copy in this distribution in file COPYING or at http://www.gnu.org.

Please check twice if you want a special license for commercial use and contact me with signed email (signed with PGP or GnuPG!) and the subject "txt2graph license".


THANKS

A special greeting goes to my friend-girl Maren for her patience, to Matthias Richter for his ideas, Leon Brocard for his GraphViz-Module and to Derek Jones for his bugreports. Last but not least a greeting goes to the team around the GraphViz-tools.


HOW TO USE IT


BUGS

There were following known bugs:

Of course, there are some additional bugs, but not known yet. If you detect one, please do not hesitate to contact me at andreas.romeyke@web.de


TODO

check extra file http://andreas-romeyke.de/txt2graph/TODO. If you have suggestions, or patches (prefered), contact me, too.


API Description

... coming soon ...