covid
Fake News and Covid-19
or what effect Corona on my thesis has ... (03.05.2020)
Förderjahr 2019 / Stipendien Call #14 / ProjektID: 4563 / Projekt: Automated Identification of Information Disorder in Social Media from Multimodal Data

Covid 19 and its effect on my master thesis is significat. On the one hand the crisis stopped our daily lives and on the other hand everyone expect you to work in a normal way. An status update...

Since the last blog entry in March 2020 a lot happend. Not only to me but also the whole mankind: Covid-19. It stopped our daily lives in a dramatically manner and forced us to change our daily work routine. It changed our why of thinking and moved most of us into Home Office.

As the head of the students union of the UAS Sankt Pölten it still is a very hard time. My team an me invested a lot of time into information campaigns and managing the workflows together with the UAS Services. I am very glad to study and work at the UAS Sankt Pölten because, beside of the huge time investment, most of our lectures and exams are now online.

I finally decided to focus on the Fake New Dataset: https://github.com/entitize/Fakeddit, you can find their paper here: https://arxiv.org/abs/1911.03854

It consists out of around 1 Million Reddits and Comments and a really huge amount of image data, which tool several days to download.

The drawback was here the release of a new dataset version a few weeks after I downloaded the dataset and started applying my method on it.

So I have decided to focus on more details of my work and move my deadline from May to August.

 

So enough from bad news, changing to the good ones:

Since the dataset has text - ( title and comments), images ( around 60 % ) and some meta-data (authors, scores) I decided to try an multimodal approach using:

  1. Text - Data
  2. Image - Data
  3. Social Meta Data

In the last few weeks I got used to a lot of preprocessing methods. I managed to write a working model for processing Text and Image Data simultanously, in and end-to-end manner. Currently I am investigating methods for normalizing image data and improving the classification part.

The next few weeks are very important for me. I want to include features out of the Socia Meta Data provided by the authors, I want to focus on tuning my network to better adapt to my problem and starting with systamatic experiments. 

Stay tuned and healthy ;)

 

 

Image: Designed by starline / Freepik

CAPTCHA
Diese Frage dient der Überprüfung, ob Sie ein menschlicher Besucher sind und um automatisierten SPAM zu verhindern.
    Datenschutzinformation
    Der datenschutzrechtliche Verantwortliche (Internet Privatstiftung Austria - Internet Foundation Austria, Österreich) würde gerne mit folgenden Diensten Ihre personenbezogenen Daten verarbeiten. Zur Personalisierung können Technologien wie Cookies, LocalStorage usw. verwendet werden. Dies ist für die Nutzung der Website nicht notwendig, ermöglicht aber eine noch engere Interaktion mit Ihnen. Falls gewünscht, treffen Sie bitte eine Auswahl: