We are still actively working on the spam issue.

Difference between revisions of "Wiki Backups"

From InstallGentoo Wiki
Jump to: navigation, search
(added internet archive/wikiapiary links)
Line 30: Line 30:
 
[[File:Wikiinxowa.jpg|thumb|200px|right|Install Gentoo wiki viewed offline in XOWA]]
 
[[File:Wikiinxowa.jpg|thumb|200px|right|Install Gentoo wiki viewed offline in XOWA]]
  
[[User:bananafish]] has the power to backup this wiki (perhaps weekly? fortnightly? monthly?) and provide public download links/magnet links/installgentoo ftp links. He just needs to follow [https://www.mediawiki.org/wiki/Manual:Backing_up_a_wiki this guide] and edit this page.
+
A backup of this wiki is now available [https://wiki-dump.installgentoo.com/ here], this includes all images, pages, and a database backup (excluding user account data) and is suited for offline viewing with xowa.  A backup of the wiki is generated nightly.  
  
 
Alternatively you can scrape the wiki yourself, using [http://www.archiveteam.org/index.php?title=WikiTeam Wikiteam]'s [https://github.com/WikiTeam/wikiteam mediawiki scripts]:<br>
 
Alternatively you can scrape the wiki yourself, using [http://www.archiveteam.org/index.php?title=WikiTeam Wikiteam]'s [https://github.com/WikiTeam/wikiteam mediawiki scripts]:<br>

Revision as of 01:00, 15 May 2015

MediaWiki Based Sites

You can create your own viewable and offline backup of most MediaWiki wikis (i.e. anything that looks like wikipedia). This is handy if you're away from your wifi connection, are too cautious to lookup certain search terms or have friends where wikis are blocked.

It's also pretty cool to have your own copy of Wikipedia.

Usable Offline MediaWiki Backups

To create an accessable, offline wiki backup you need two things:

  1. A dump of the wiki, often compressed into a single file (sources below).
  2. A program to access and search the compressed wiki file with.

XOWA is a cross-platform, AGPLv3 licensed program to view wiki dumps with. It's simple to use and pretty easy to setup. Wiki dumps need to be imported before use, and on an i7 running at 3.4ghz take about 1mb/sec to import. So about a minute for WikiVoyage and half a day for the full english Wikipedia.

Creating Usable MediaWiki Backups

MediaWiki based wikis (like this one) can be backed up on your computer:

WikiMedia Backup Sources

WikiMedia creates backups monthly, available from WikiMedia Dumps.

  • These are text only dumps.
  • They backup all of their wikis in all languages.
  • There are several versions of each wiki do download. You likely want the [wikiname]-[date]-pages-articles.xml.bz2 file, which they kindly highlight in bold for you. This version lacks the full edit history and details of the non text items (pics, vids etc). If you want full edit history, there are several versions available.
  • Full, compressed text backup of the english Wikipedia is about 10gb.

BurnBit is a useful site for downloading large WikiMedia dumps. It's a site that can create a torrent from any url and also acts as a tracker. Paste your WikiMedia backup file url into it to create/access the torrent for it and you'll download at maxspeed.

This Wiki's Backup

Install Gentoo wiki viewed offline in XOWA

A backup of this wiki is now available here, this includes all images, pages, and a database backup (excluding user account data) and is suited for offline viewing with xowa. A backup of the wiki is generated nightly.

Alternatively you can scrape the wiki yourself, using Wikiteam's mediawiki scripts:
$ sudo apt-get install python2 python-kitchen python-requests git p7zip
$ python2 dumpgenerator.py --xml --images

WikiLeaks Backups

WikiLeaks released three torrent files in August 2013 named "insurance":

  • WikiLeaks insurance 20130815 - A (3.6gb)
    • MD5: a243f323612b86155e4c44c7efa38d90
    • SHA1: a3e666f7f03001ce1b6556133b5217ab0d668463
    • SHA256: 6688fffa9b39320e11b941f0004a3a76d49c7fb52434dab4d7d881dc2a2d7e02
    • SHA512: c865d260e96a654540b4ef34be4242e5105d5260059436779028f1db0324f046b11a83098d561aa855ad7cc823e9e72c59fe59e92b246889985054edfaea1ef2
    • KAT magent link
    • TPB magnet link
  • WikiLeaks insurance 20130815 - B (49gb)
    • MD5: 0a7f57171f4ba49e42d3cb9cd602ec72
    • SHA1: 7e56d7a720ba6e9b00bbb66e6f64bd46e9285361
    • SHA256: 3dcf2dda8fb24559935919fab9e5d7906c3b28476ffa0c5bb9c1d30fcb56e7a4
    • SHA512: 37f3c44c6a8b51d6c7da84386ecc9b2ef4b9d1ca6df44ebee606742772be14c53811e883bcc0e8c659c7a4fe3ecf7b170585bbdf0a0c5b305a51162ce49147e5
    • KAT magnet link
    • TPB magnet link
  • WikiLeaks insurance 20130815 - C (349gb)
    • MD5: c735e3f7c6d0ae2cad131b5539d303b0
    • SHA1: e74fd2fdd5e3bc6a0cb26813746912394385422e
    • SHA256: 913a6ff8eca2b20d9d2aab594186346b6089c0fb9db12f64413643a8acadcfe3
    • SHA512: e2385bf423e7b10aae121a2cf6467d996d32814eefc70c0fe08daa66096119a202d108e199a26ab6f1cbba0c6b1bfc03e9c670b853cc346dd061ce6b49a6f819
    • KAT magnet link
    • TPB magnet link

These are all encrypted and no password has been released. You can find the torrents here:

If the password is ever released ("Whatever happens, even if there's video; it was murder"), the files are encrypted via [OpenSSL file encryption].