absetfan is a comprehensive web scraping tool for retrieval of information from the site fanfiction.net. “Fanfiction” is a storyform where fans of a particular “fandom,” or narrative from one of several media types (ex, books, movies, comics, etc), create written pieces using details from those original narratives. These details can be as specific or general as the fan ordains; the stories could involve the same setting as the original fandom, the same world conditions (ex, include magic), the same or similar characters, and any permutation of associated characteristics. Writers range from novice to highly skilled.

The authors of this package see fanfiction.net as a potentially rich and underutilized text data source; operating since 1998 with a community of over 10 million registered users, the site hosts a substantial volume of curious and slightly unorthodox information (link). Access to individual fandom story listings may offer insight to current usages of the storytelling platform, as well as shifting popularities of different narratives (through exploration of reviews, post numbers, follows, favorites, and so on). Working with the raw data presented within story chapters could facilitate practice in text mining, or reveal patterns in narrative construction. Tracking such patterns could offer analysis relevant to a greater cultural context, as narratives like 50 Shades of Gray originally got their beginnings from fan written work (in this case, a Twilight fanfiction).

Overall, absentfan may offer access to a plethora of user queried data excellent for use in exploring R base functions and text mining techniques.

For more on fanfiction, we recommend you look here (via).

Maintaining the Scraper

Update Repository of Story Tags

The absentfan package uses a pre-scraped list of href tags from the fanfiction.net media pages to get information about stories or the stories themselves. If the user feels that the fandom they are looking for is missing due to it being very recent (after 2018), they may use the updateTypeMedia function to update the overall list of fandoms available for scraping with this package.

typeMedia <- updateTypeMedia()