View Single Post
  #139 (permalink)  
Old February 28th, 2006, 03:03 PM
connell connell is offline
Newbie Floating Down The Mistic River
 
Join Date: Feb 2006
Location: Scotland
Posts: 1
Hi,

I'm new in this forum and joined due to this thread.

I thought about having wikipedia on my iriver last year, and downloaded the xml wikipedia output file pages-articles.xml.bz2. I then re-imported this into a MySQL database using MediaWiki 1.5. Then using the ipod linux script, with a few modifications, I started getting the HTML output from the version of wikipedia now on my computer. This script uses the *nix utility html2text to strip out all html tags (including links). I had to stop this as my version of MySQL was too old (and causing problems). What I was going to do after that was write a shell script to sort the files into folders depending on the title (and with a maximum depth of say 4-8 directories:
For Apples:
A\P\P\L\Apples.txt
For Bus:
B\U\S\Bus.txt
This would ensure that it would be quite quick to navigate to an article in the fileview and it would be simple to write a plugin that observes the trend and provides a very simple search facility.
An index could be created which allows you to more comprehensively search for titles.

I think I'll try and update MySQL and try this again. I'll post if I get any further.

Thanks,
Connell
(inspired by Doug. Adams to do this aswell
Reply With Quote