This Malazan Book of the Fallen Compendium is a project created by Atharva Shah for educational purposes. It is an unofficial book compiled from the content written by the talented individuals at Tor, namely Bill Capossere and Amanda Rutter. The text content is released under Creative Commons BY-SA.

Please note that this project is not affiliated with Tor or Steven Erikson, the author of the Malazan Book of the Fallen series. All trademarks and registered trademarks mentioned in this compendium belong to their respective company owners.

GITHUB : https://github.com/HighnessAtharva/Malazan-Compendium

DEMO : https://highnessatharva.github.io/Malazan-Compendium/

Project Summary

The purpose of this project is to provide a comprehensive compilation of chapter summaries from each book of the Malazan Book of the Fallen series by Steven Erikson. This compendium serves as a valuable resource for readers who wish to revisit or explore the series, offering a concise overview of the plot and events in each chapter.

The compendium was created by web scraping the Tor site, extracting nearly 150+ links containing the chapter summaries. The scraped data was then migrated to Markdown format and converted to PDF/HTML using pandoc. The converted HTML files are hosted on GitHub Pages for easy access and navigation.

Project Tech Stack

The following technologies and tools were utilized in the development of this project:

  • Web scraping using Beautiful Soup, Selenium, and Scrapy
  • Markdown for formatting the extracted data
  • Pandoc for converting Markdown to PDF/HTML
  • GitHub Pages for hosting the converted HTML files

Learnings

Throughout the course of this project, several valuable lessons were gained:

  • Proficiency in web scraping techniques using libraries like Beautiful Soup, Selenium, and Scrapy
  • Familiarity with Markdown syntax for text formatting
  • Understanding the conversion process from Markdown to PDF/HTML using pandoc
  • Installating Custom Markdown Themes for Exporting PDF/HTML
  • Experience in hosting HTML files on GitHub Pages for easy accessibility

This project provided an opportunity to explore and apply various technologies in a practical context, enhancing skills related to web scraping, data manipulation, and document conversion.


Please note that this project is a personal endeavor and should be used solely for educational purposes. The intention is to provide a convenient resource for fans of the Malazan Book of the Fallen series. Enjoy exploring the chapter summaries and dive deeper into the intricate world created by Steven Erikson.