[ < ] [ > ]   [ << ] [ Up ] [ >> ]         [Top] [Contents] [Index] [ ? ]

5. Extracting Links

This section is written assuming that you are using a standard HTML infostructure in a directory or on the World Wide Web

The first part of using link controller is to extract the links. When doing this, a pair of index files is built which list which URLs happen on which pages along with a file listing all of the URLs in the infostructure.

FIXME: compare and contrast multi-user configuration with single user

The first stage of the process is done by extract-links (2).

There are two modes for extract links directory and www. The key difference between them is that the latter actually downloads from a server so it is less efficient but will work in more circumstances and is more likely to represent your site as seen by users. This is assuming that all of your WWW pages are interconnected so it can find them.

FIXME : need to describe modes of operation of extract link

extract-links creates three files. The first two files (`*.cdb') are the index files for your infostructure and are located wherever you have configured them to by default they are called `link_on_page.cdb', `page_has_link.cdb'. The third file is the database file `links.db'. extract-links can also optionally create a text file which lists all of the urls in the infostructure, one per line.


[ << ] [ >> ]           [Top] [Contents] [Index] [ ? ]

This document was generated by Michael De La Rue on December, 30 2001 using texi2html