Code-mixing is the alternation between two or more languages in the same text. This phenomenon is very relevant in the travel domain, since it can provide new insight in the way foreign cultures are perceived and described to the readers. In this paper, we analyse EnglishItalian code-mixing in historical English travel writings about Italy. We retrain and compare two existing systems for the automatic detection of code-mixing, and analyse the semantic categories mostly connected to Italian. Besides, we release the domain corpus used in our experiments and the output of the extraction

Sprugnoli, R., Tonelli, S., Moretti, G., Menini, S., A little bit of bella pianura: Detecting Code-Mixing in Historical English Travel Writing, in Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017), (Roma, 11-13 December 2017), Accademia University Press, Torino, Italy 2017:2006 304-309. [10.4000/books.aaccademia.2469] [http://hdl.handle.net/10807/132955]

A little bit of bella pianura: Detecting Code-Mixing in Historical English Travel Writing

Sprugnoli, Rachele
Primo
;
2017

Abstract

Code-mixing is the alternation between two or more languages in the same text. This phenomenon is very relevant in the travel domain, since it can provide new insight in the way foreign cultures are perceived and described to the readers. In this paper, we analyse EnglishItalian code-mixing in historical English travel writings about Italy. We retrain and compare two existing systems for the automatic detection of code-mixing, and analyse the semantic categories mostly connected to Italian. Besides, we release the domain corpus used in our experiments and the output of the extraction
2017
Inglese
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
CLiC-it 2017
Roma
11-dic-2017
13-dic-2017
978-88-99982-76-8
Accademia University Press
Sprugnoli, R., Tonelli, S., Moretti, G., Menini, S., A little bit of bella pianura: Detecting Code-Mixing in Historical English Travel Writing, in Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017), (Roma, 11-13 December 2017), Accademia University Press, Torino, Italy 2017:2006 304-309. [10.4000/books.aaccademia.2469] [http://hdl.handle.net/10807/132955]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/132955
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact