MöbiusMöbius
  • Our story
  • Open call
  • News
  • Community
  • Join Us
  • Our story
  • Open call
  • News
  • Community
  • Join Us
Foto de Jason Goodman en Unsplash

Predicting content popularity in fanfiction communities

Clara Pont2022-05-17T10:23:14+02:00

In this post, Eurecat’s Computational Social Science team presents a model for predicting the popularity of content in fanfiction communities, developed as part of the work carried on in the Möbius project.

From customer intelligence to prosumer intelligence

One of the aims of the Möbius project is to guide the publishing sector in dealing with the emerging prosumer paradigm. While current practices of publishers are still mostly based on a vision of the consumer as a passive actor who will just buy or not buy the product, and more or less traditional marketing and recommendation approaches are followed in order to maximize sales, the potential of the prosumers as co-creators of content and in all the steps of the process is not fully taken into account; the enormous wealth of content and interactions generated by these online communities remains still untapped.

Taking advantage of the wealth of data created by prosumers, in the form of original content, reviews, feedback and interactions, implies being able to mine this data, make sense of it and extract actionable knowledge. To this end, the Eurecat team developed computational methods to analyse content and interactions in prosumer communities, based on established platforms where it is possible to access large amounts of data.  

One of the needs detected in the conversations and focus groups with publishers is that of predicting content popularity, to be able to detect trends and identify content that has the potential for publishing.

Data scraping from a fanfiction platform: Archives of Our Own

To address this issue, the Eurecat team developed a model for predicting the popularity of works in fanfiction communities and applied it to the Archives of Our Own platform (AO3), an open-source fanfiction platform developed and maintained by fans, where users can publish works and review each other’s works. The website is one of the main references worldwide for fanfiction work, and as reported on its home page it currently hosts almost 9 million works created by the users, organized in over 40 thousand fandoms, with over 4 million registered users.

Source: Archive of Our Own

The dataset scraped from AO3 includes the complete interaction data for works from seven communities: Marvel, Harry Potter, Sherlock Holmes, Lord of the Rings, Percy Jackson, Twilight, and Warriors. For each of these communities, all the comments, replies, bookmarks and kudos (similar to likes) were retrieved. 

Model for predicting the popularity of content

Among other analyses performed on this data, the Eurecat team developed a model to predict works that will become popular in the near future, based on the previous history and on current growth speed. 

The popularity of work is defined as the number of distinct users that have written at least one comment on it; this is more accurate than considering the overall number of comments, which could be less relevant if due to the activity of just a few very active users. The aim is to identify works that will be in the top 1% according to this measure after 30 or 60 days.  

The model developed uses logistic regression to predict whether work will become popular, based on the total feedback acquired until the current time, and the feedback variation over the last days. The prediction achieves good accuracy, with a precision of 0.79, recall of 0.90, and F1 score of 0.84. 

The model was then re-adapted to study tags instead of works, considering the popularity of a tag as derived from the (normalized) popularity of the works to which the tag is assigned. With tags, the accuracy obtained is higher: precision of 0.85, recall of 0.91, and F1 score of 0.87. 

Conclusion and next steps: Prosumer Intelligence Toolkit

With the first model one can identify works that are very likely to become popular, and therefore could be for example good candidates for being considered for publishing; with the second model, one is able to identify trending topics, i.e. keywords, categories, terms, topics, genres or subgenres that are growing in popularity and therefore represent promising fields for exploration.  

This is one of several analyses that are being carried out leveraging data from established fanfiction communities. The next step consists in developing a Prosumer Intelligence Toolkit with interactive dashboards to show the potential of this kind of metrics to extract actionable knowledge and foster cooperation between prosumers and the publishing sector.

This work was carried out by the Computational Social Science team at Eurecat, whose work focuses on studying social interactions and collective behaviour on online platforms.


Related Posts

Photo by Jason Goodman on Unsplash

Reading Rationals: Consciously Unconscious Customer Centricity

In this post Markus Fertig, PR Manager at MVB, introduces us to the Reading Rationals. A new standard in... read more

Photo by Andrew Neel on Unsplash

The publishing industry through a cross-media approach. The example of the Möbius project in exploring new applications for the book sector and the role of the prosumer

Over the last decades, the adoption of digital technology led to the transformation of entire traditional sectors. This huge... read more

Photo by Maarten van den Heuvel on Unsplash

The end of books, a non-sense

This is the second instalment of four publications that will outline the on-going evolution of the book industry.  Books create conversations, debates, controversies, and every generation has its titles. Sometimes... read more

Photo by freddie marriage on Unsplash

The business of book publishing: when the encyclopedia salesperson is uploaded to the cloud

This is the third instalment of four publications that aim to identify some of the trends of the twenty-first-century book industry by uncovering its past.  The dust that covers a book in... read more

Photo by Carlos Muza on Unsplash

Business exploitation scenarios for prosumer book publishing

This article presents IMEC-SMIT’s business modellers. Due to the importance of the prosumer in the Möbius project, its complicated... read more

The publishing sector, at the edge of a fundamental change

This is the first installment of a four-part publication that will introduce the changes in one of the most important... read more

Stay connected

AvatarMöbius@mobius_europe·
1 Jul

💡Do you prefer to read on paper or on screen? This dilemma arises constantly, but here it is explainedwhat has happened to traditional reading after the pandemic. Discovering and reading go hand in hand, join us! 🔗👉

https://www.ft.com/content/a1768ded-e039-4c3b-8590-1d5d811ed1c2

📸 Sincerely Media en Unsplash

Reply on Twitter 1542878268495470593Retweet on Twitter 15428782684954705931Like on Twitter 15428782684954705932
AvatarMöbius@mobius_europe·
30 Jun

Möbius Awards Ceremony🏆 The winners of the open call for manuscripts were announced!

👉Meet the winner of the first mobius book
👉Get to know the project and find out how you can be the next one.
👉Discover the winners

Enjoy the full event here 🔗https://youtu.be/elz4tisizM4

Reply on Twitter 1542535943671922688Retweet on Twitter 1542535943671922688Like on Twitter 15425359436719226881
⏰THE MÖBIUS OPEN CALL IS CLOSING TOMORROW! 'Th ⏰THE MÖBIUS OPEN CALL IS CLOSING TOMORROW!

'The future cannot be predicted, but futures can be invented.' The Nobel Prize Dennis Gabor once said, which seemed to be visioning what the Möbius project aims to do: create the world of tomorrow with the values we dream today according to the New European Bauhaus.🚀

Are you thrilled to be part of it? Then, do not hesitate to participate in the Möbius Open Call! 
 
🥇1,500€ will be awarded for the best story 

 🥈🥉500€ will be awarded for each of the 2nd and 3rd places.  

 🎫The prizes will be delivered in kind 
 
 Eligibility conditions: 

 💡Unpublished work 

 📄Max. length of 6.000 characters (including spaces) 

 🌍Participants must be of legal age (>18) and EU-citizens (plus Norway and Iceland) 

 💬Languages: English, Italian and Spanish  

 🗓Deadline: TOMORROW!
 👉 Link in bio!
The Möbius Open Call is about to close! Don't pro The Möbius Open Call is about to close! Don't procrastinate the submission of your manuscript because there are huge things that can come up!  

🥇1,500€ will be awarded for the best story 

 🥈🥉500€ will be awarded for each of the 2nd and 3rd places.  

 🎫The prizes will be delivered in kind 

Eligibility conditions: 

 💡Unpublished work 

 📄Max. length of 6.000 characters (including spaces) 

 🌍Participants must be of legal age (>18) and EU-citizens (plus Norway and Iceland) 

 💬Languages: English, Italian and Spanish 

 🗓Deadline: January 15th, 2022 

Stay connected! 

Link in bio!
🟣APPLY TO THE MÖBIUS OPEN CALL! This is a 🟣APPLY TO THE MÖBIUS OPEN CALL! 

This is a call to all writers, whether professional or amateur🤩! We are looking for short original fantasy stories. The winning manuscript will become the 1st Möbius Book, a cross-media experience involving a 3D audio and virtual reality production and art installation! We want authors to imagine and build a beautiful and liveable future, the future we dream about. Participate now in the creation of the #MobiusBook!🔝 

 🥇1,500€ will be awarded for the best story 

 🥈🥉500€ will be awarded for each of the 2nd and 3rd places.  

 🎫The prizes will be delivered in kind 

Eligibility conditions: 

 💡Unpublished work 

 📄Max. length of 6.000 characters (including spaces) 

 🌍Participants must be of legal age (>18) and EU-citizens (plus Norway and Iceland) 

 💬Languages: English, Italian and Spanish 

 🗓Deadline: January 15th, 2022 

 Link in bio!
🟣 Does this slogan ring a bell? “A book for t 🟣 Does this slogan ring a bell? “A book for the price of a pack of cigarettes” It was from Allen Lane, who started the @penguinrandomhouse

After spending a weekend with world-famous author Agatha Christie, Lane realized that there were no quality books available in train stations, only magazines and pulp fiction. To ensure the success of his business model, he had to sell at least 17,000 copies of each book he published, yet after just one year, three million copies of the first ten published books of the new house were sold! 📚
👉 Link in bio!
🖊Marjorie Grassler
🟣Did you know that the five genres that are mos 🟣Did you know that the five genres that are most likely to generate bestsellers according to Nielsen Book Data are general and literary fiction; children's fiction, crime, thriller and adventure; young adult fiction and romance and sagas?

👉 Link in bio!
🖊Marjorie Grassler
🟣 With the arrival of the internet and other te 🟣 With the arrival of the internet and other technologies, the publishing sector had to dig to its deeper roots to extract new formats to reach its audiences 📚. So many business models have emerged to tackle the digital disruption, adapting sometimes the old way of working to the online and interactive environment. The encyclopedia salesperson has not disappeared; they have only been uploaded to the cloud!

👉 Link in bio!
 🖊Marjorie Grassler

#mobiusbook #booklovers #bookstagram #techlover #cloud
🟪Book publishing is the largest cultural indust 🟪Book publishing is the largest cultural industry in Europe🤩

💡Did you know that the total market value is €36-38 billion? (@europeancommission)

The majority of the world’s top publishing groups are European-owned: six to eight of the top 10 publishing groups, according to the annual Global Ranking of the Publishing Industry).” 🌍🚀

Source: Publishing Perspectives

#mobiusbook #booklovers #inspiration #publishing #culture #europe #h2020
🟣 Did you know that e-books are 15-20% cheaper 🟣 Did you know that e-books are 15-20% cheaper to produce than physically books? 🤩 Ebooks are often cheaper in the long run than traditional books because there are no printing fees associated with them 🚀 

Source: @breobox

📸 Photo by Perfecto Capucine on Unsplash
🟣Looking for fan fiction enthusiasts to join 🟣Looking for  fan fiction enthusiasts  to join an online discussion on  new reading experiences  next Thursday!

✨ Interested in joining? 

👉Leave your details in the link you will find in the bio and we'll reach out to you
🟣🗣 Did you know that in Europe every year th 🟣🗣 Did you know that in Europe every year there are 500.000 titles published and there are 22 millions of titles available? Looking at longer term trends, up to 2007 there was steady growth both in terms of turnover and of title output. In 2008 title production kept growing whilst turnover, adjusting for exchange rates, experienced a flat year. 2009 showed a slight decrease in turnover (accounting for exchange rates) and a slowdown of title growth 📚. 

💡 The crisis had less of an impact on publishing when compared to most other sectors. In 2010, growth resumed (especially exports), although favoured by exchange rates. 📖In 2011 and 2012, the market went down, and title production growth was sluggish; the e-book market grew rapidly, and exports were strong. In 2013 and 2014 the market slowed down again, with the most notable trends being the continuous growth of the e-book market and the good performance of exports, which became even stronger from 2015 to 2017. 

📲The e-book market (now nearing 10% of the total) showed signs of stagnation for the last 5 years (but it could be a matter of capturing the right data), whereas audio book sales exploded in 2019, giving new impetus to digital sales. If 2018 marked a trend reversal in the recovery process started in 2015, 2019 confirmed the positive trend🚀. 

Source: @fedeuropeanpublishers
🟪 💡 “We may be drowning and not even know 🟪 💡 “We may be drowning and not even know it.” Philip Jones, the Bookseller. This sentence opens a discussion on the turbulence created in traditional value chains in the latest edition of The Business of Books 2019. It is an urgent call for action for an industry that is hardly acknowledging that book and reading markets have changed profoundly and have entered the “attention economy” in which books, authors, publishers and retailers find themselves competing against anything and everybody else that connects with the audience. 

Deeper shifts in many of these markets require us to dive beneath the Surface. From Argetina Brazil and Mexico, to the UK and to a slightly less extent, even traditionalist Germany, self-publishing has managed to establish itself as a new industry segment of significant scope -one that has achieved remarkable economic success. However, due to a lack of data, our undestanding of this segment is appallingly incomplete📚

Source: The Business of Books 2019

📸 Photo by Victoria Heath on Unsplash

#mobiusbook #stories #booklovers #bookaholics #freshthoughts #hypertext #readers #bookstagram #bookaddict #inspiration
🟪The closure of the bookshops has been devastat 🟪The closure of the bookshops has been devastating for the publishing sector. The Spanish ones had 22’5% of losses during the pandemic, as explained by the sector (elperiodico). ⁠
⁠
Product offerings consumer preferences and consumption patterns are becoming more liquid and segmented by various factors, they include different perspectives on books, reading and audio consumption 💡.
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement no. 957185