Hi,
I need someone to parse the text from wikipedia and remove all wikipedia markup so i just have the basic article.
If you goto:
[login to view URL]:Export/train
you will see the article but surrounded by wikipedia markup language, i need this removing and just the basic text. I'm not bothered about images or tables.
I also need the headings converting from ==Freight trains== to <h1>Freight trains</h1>
this needs to work for any keyword passed to wikipedia
e.g.
[login to view URL]:Export/England
would work too.
I'd also like the script to detect redirects and bring up the redirected article.
e.g.
If you look at:
[login to view URL]:Export/UK
it redirects to:
[login to view URL]:Export/United_Kingdom
good luck!