IRIS UniCatt

In the recent years, the reliability of information on the Internet has emerged as a crucial issue of modern society. Social network sites (SNSs) have revolutionized the way in which information is spread by allowing users to freely share content. As a consequence, SNSs are also increasingly used as vectors for the diffusion of misinformation and hoaxes. The amount of disseminated information and the rapidity of its diffusion make it practically impossible to assess reliability in a timely manner, highlighting the need for automatic online hoax detection systems. As a contribution towards this objective, we show that Facebook posts can be classified with high accuracy as hoaxes or non-hoaxes on the basis of the users who âlikedâ them. We present two classification techniques, one based on logistic regression, the other on a novel adaptation of boolean crowdsourcing algorithms. On a dataset consisting of 15,500 Facebook posts and 909,236 users, we obtain classification accuracies exceeding 99% even when the training set contains less than 1% of the posts. We further show that our techniques are robust: they work even when we restrict our attention to the users who like both hoax and non-hoax posts. These results suggest that mapping the diffusion pattern of information can be a useful component of automatic hoax detection systems.

Tacchini, E., Ballarin, G., Della Vedova, M. L., Moret, S., De Alfaro, L., Some like it Hoax: Automated fake news detection in social networks, Paper, in CEUR Workshop Proceedings, (Skopje, 18-18 September 2017), CEUR-WS, AACHEN -- DEU 2017: 1-15 [http://hdl.handle.net/10807/116519]

Some like it Hoax: Automated fake news detection in social networks

Tacchini, Eugenio;Ballarin, Gabriele;Della Vedova, Marco Luigi;Moret, Stefano;de Alfaro, Luca

2017

Abstract

In the recent years, the reliability of information on the Internet has emerged as a crucial issue of modern society. Social network sites (SNSs) have revolutionized the way in which information is spread by allowing users to freely share content. As a consequence, SNSs are also increasingly used as vectors for the diffusion of misinformation and hoaxes. The amount of disseminated information and the rapidity of its diffusion make it practically impossible to assess reliability in a timely manner, highlighting the need for automatic online hoax detection systems. As a contribution towards this objective, we show that Facebook posts can be classified with high accuracy as hoaxes or non-hoaxes on the basis of the users who âlikedâ them. We present two classification techniques, one based on logistic regression, the other on a novel adaptation of boolean crowdsourcing algorithms. On a dataset consisting of 15,500 Facebook posts and 909,236 users, we obtain classification accuracies exceeding 99% even when the training set contains less than 1% of the posts. We further show that our techniques are robust: they work even when we restrict our attention to the users who like both hoax and non-hoax posts. These results suggest that mapping the diffusion pattern of information can be a useful component of automatic hoax detection systems.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2017
			
	Lingua del contenuto
	
				Inglese
			
	Titolo del volume che raccoglie gli atti
	
				CEUR Workshop Proceedings
			
	Denominazione evento
	
				2nd Workshop on Data Science for Social Good, SoGood 2017
			
	Luogo dell'evento
	
				Skopje
			
	Tipo di contributo
	
				Paper
			
	Data inizio evento
	
				18-set-2017
			
	Data fine evento
	
				18-set-2017
			
	Editore
	
				CEUR-WS
			
	URL alternativo
	
				http://ceur-ws.org/
			
	Citazione
	
				Tacchini, E., Ballarin, G., Della Vedova, M. L., Moret, S., De Alfaro, L., Some like it Hoax: Automated fake news detection in social networks,  Paper, in CEUR Workshop Proceedings, (Skopje,  18-18 September 2017), CEUR-WS, AACHEN -- DEU 2017: 1-15 [http://hdl.handle.net/10807/116519]
			
	Appare nelle tipologie:
	
				Paper, Selected paper, Contributed paper, Working paper, Poster, Poster paper, Comunicazione, Relazione (in volume)

File in questo prodotto:

File	Dimensione	Formato
paper2.pdf accesso aperto Tipologia file ?: Postprint (versione finale dell’autore successiva alla peer-review) Licenza: Non specificato Dimensione 446.31 kB Formato Adobe PDF Visualizza/Apri	446.31 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/116519

Citazioni

ND

187

ND

social impact