Aug. 2nd, 2011

kerravonsen: 9th Doctor wearing his headlamp: Technical wizard (technical-wiz)
[personal profile] kerravonsen
For all of us here who are fannish as well as geeky, you might be interested in this.

I have written my own fan-fiction downloader in Perl, which can be installed from CPAN as "WWW::FetchStory". There are probably Linux-isms in the code. (frown) For example, it uses the "wget" program to do the actual downloading.

But I would love other people to use the script! It has plugins (which I am calling "fetchers") for various different fiction sites, which know how to download multi-chapter fics from those sites, so you only have to give the table-of-contents URL for the fic and it will figure out the rest. Depending on the particular fetcher, it will get not just the title and author, but the summary, the categories and the characters.
It also has an option to create an EPUB file rather than HTML files.

Currently, I have written fetchers for:list of fanfic archives )
But every now and then, those sites change their code and the fetcher for that site breaks. (frown)

Also, for a number of those archives, you must be logged in if you want to download "adult" rated fic. The solution I devised for that is rather clumsy (and Linux-centric); it looks for a "cookies.txt" file in your home directory, which you need to have exported from your browser after you logged in to the site.
If someone has a better solution, I would love to hear from you.

For the more geeky among you, the source is in my git repository at https://github.com/rubykat/WWW-FetchStory
I would LOVE people to contribute to it, whether that be fixing bugs, fixing documentation, improving fetchers, or writing new fetchers.

Profile

perl: cc-by-nc (Default)
Pathologically Eclectic Rubbish Lister

August 2012

S M T W T F S
   1234
56 7891011
12131415161718
19202122232425
262728293031 

Style Credit

Expand Cut Tags

No cut tags
Page generated Jul. 1st, 2025 03:08 am
Powered by Dreamwidth Studios