Note: This is my personal, mostly rewritten and thus incompatible fork of emoose's 4chan-archiver. I've stopped using and maintaining it in mid 2014, but it's still working mostly fine as of 2014-12-27 2016-06-13.
Features I've removed for the sake of simplicity:
- Username/password authentication. I recommend running 4chan-archiver in a directory that's password-protected using
.htaccessor similar (which is more secure anyway). - Automatic updates.
Some of the features I've added:
- Different Design: not sure if it's a huge improvement over emoose's, but it's a bit more colorful.
- ZIP compression: you can create a ZIP archive of a thread at any time, it will be automatically updated once the thread 404's (or created if none exists, which you can deactivate in
config.php). - Marked Threads: will show up at the top of the thread list
- Added Column: see when each thread was added.
- Posts Column: for each thread, the number of posts and images will be displayed.
- Folder Size Stats.
- More granular control over how often different kinds of threads are checked. Here's an example crontab:
48 */4 * * * php -f /path/to/cron.php
*/8 * * * * php -f /path/to/cron_fast.php
*/32 * * * * php -f /path/to/cron_marked.php
*/16 * * * * php -f /path/to/cron_size.php
Known bugs:
- SQL injection and related exploits are very possible. I recommend running 4chan-archiver in a directory that's password-protected using
.htaccessor similar. - If your browser window is narrower than 1440px, things might look out of place. Try decreasing the width of
table.threads input.descinstyle.css. - If adding a thread fails, make sure to remove the appended subject/comment snippets.
- Entries of the
Poststable aren't always properly deleted.
Below is the original README.md:
This archiver is outdated and won't be maintained anymore, I've made a new version called "chan archiver" and uploaded it to http://github.com/emoose/chan-archiver/, I'd suggest you either use that or one of the forks available.
GNU public license 3 blah blah blah, can't be bothered to add the text and stuff in here. If you use/modify this just give credit to the github.
These small scripts let you create your own little 4chan archive, without needing to use crappy advert ridden websites! (or overly worked on perl scripts, this is 4 hours work)
- Fully parse and download any thread
- Very small overhead
- Just over 300 lines of code!
- Simple login system (see config.php)
- PHP 4+
- MySQL
- Server that supports cronjobs (or some other kind of scheduling device)
- Import chanarchive.sql into some database
- Setup config.php with your paths and mysql info
- Add a cronjob to /usr/bin/php -f /path/to/cron.php (might not be /usr/bin/php, check with your server admin)
Have fun! and if you are updating MAKE SURE YOU DELETE VERSION.TXT!
Any bugs? Post on the github!