Go to file
2024-08-11 02:46:28 -05:00
.gitignore fix: add .gitignore 2023-09-16 22:29:04 -05:00
afw.py fix make_archive 2024-08-11 02:46:28 -05:00
Containerfile refactor: combine RUN commands 2023-09-17 21:53:52 -05:00
entrypoint.sh Refactor 2024-07-27 11:39:12 -05:00
LICENSE Initial commit or whatever 2023-09-14 20:36:56 -05:00
README.org Refactor 2024-07-18 02:40:18 -05:00
requirements.txt Refactor 2024-07-18 02:40:18 -05:00

archive-fandom-wiki

This program archives the content of fandom wikis. It doesn't scrape from the fandom.com wiki sites directly; rather, it uses my BreezeWiki instance to avoid downloading unnecessary ads, images, and other junk.

Each resulting archive is self-contained, meaning one can extract the contents and browse the wiki snapshot locally (offline). The URLs for CSS, images, and links in each page are replaced by the relative file:/// URLs for their corresponding pages on the local filesystem.

Installation

Make sure Python and Pip are installed. Then run:

git clone https://codeberg.org/hyperreal/archive-fandom-wiki.git
cd archive-fandom-wiki
python -m venv venv
source venv/bin/activate
pip install -r requirements.txt

If not using venv:

pip install --user -r requirements.txt
ln -sf "$(pwd)/afw.py" ~/.local/bin/afw

Usage

One may specify the BreezeWiki instance URL, or the default value (my BreezeWiki instance URL) will be used.

afw dishonored https://breezewiki.instance.url
afw dishonored

Podman/Docker

There is also a Containerfile, also known as a Dockerfile.

git clone https://codeberg.org/hyperreal/archive-fandom-wiki
cd archive-fandom-wiki
podman build -t localhost/archive-fandom-wiki:latest .

To run the container image:

podman run --name archive-fandom-wiki --rm -v "${HOME}/archives:/output:Z" localhost/archive-fandom-wiki dishonored