IRIS UniCatt

This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.

Sprugnoli, R., Caselli, T., Tonelli, S., Moretti, G., The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts, in Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, (Valencia (Spagna), 03-07 April 2017), Association for Computational Linguistics, Valencia (Spagna) 2017:2 260-266. [10.18653/v1/e17-2042] [http://hdl.handle.net/10807/132848]

The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts

Sprugnoli, Rachele^Primo;Tommaso Caselli^Secondo;Sara Tonelli^Penultimo;Giovanni Moretti^Ultimo

2017

Abstract

This paper presents a new resource, called Content Types Dataset, to promote the analysis of texts as a composition of units with specific semantic and functional roles. By developing this dataset, we also introduce a new NLP task for the automatic classification of Content Types. The annotation scheme and the dataset are described together with two sets of classification experiments.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno pubblicazione
	
				2017
			
	Lingua del contenuto
	
				Inglese
			
	Titolo del volume che raccoglie gli atti
	
				Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers
			
	Denominazione evento
	
				15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017
			
	Luogo dell'evento
	
				Valencia (Spagna)
			
	Data inizio evento
	
				3-apr-2017
			
	Data fine evento
	
				7-apr-2017
			
	ISBN del volume
	
				978-1-945626-34-0
			
	Editore
	
				Association for Computational Linguistics
			
	DOI del contributo
	
				https://dx.doi.org/10.18653/v1/e17-2042
			
	URL alternativo
	
				https://www.aclweb.org/anthology/E17-2042
https://aclweb.org/anthology/papers/E/E17/E17-2042/
			
	Citazione
	
				Sprugnoli, R., Caselli, T., Tonelli, S., Moretti, G.,  The Content Types Dataset: a New Resource to Explore Semantic and Functional Characteristics of Texts, in Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, (Valencia (Spagna),  03-07 April 2017), Association for Computational Linguistics, Valencia (Spagna) 2017:2 260-266. [10.18653/v1/e17-2042] [http://hdl.handle.net/10807/132848]
			
	Appare nelle tipologie:
	
				Atti di Convegno, Congresso, Giornate di studio, ecc., Workshop (in volume)

File in questo prodotto:

File	Dimensione	Formato
E17-2042.pdf accesso aperto Tipologia file ?: Versione Editoriale (PDF) Licenza: Creative commons Dimensione 115.74 kB Formato Adobe PDF Visualizza/Apri	115.74 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/132848

Citazioni

ND

5

ND

social impact