Technical Information
To process the
String Cheese setlists from
Friends of Cheese,
I first run them through the non-graphical browser
lynx, using the
-dump option. This removes the html coding, and inserts line
breaks in appropriate places, which is critical to correct parsing later on.
All of the post-processing of the setlists is done through a set of
Python programs. The total length of these programs is about 700-800
lines. The song list and show
list are generated automatically, along with all the individual set
lists, and the every time played pages.
The search engine is also powered by a Python
program, run as a CGI program on an Apache web server. Three separate
Berkeley DB hashes, one each for venues, songs and notes, are generated at
the same time as the main pages, and are accessed by the search engine for
queries.
My server (edithst.com) is a generic Pentium 4 Intel-based
computer running version 2.4.9 of
Linux, connected to the internet on a
DSL line.
If you'd like more information about Cheesebase, or if you'd like to see the
Python programs I use to produce it, please
contact me.