Every day, insurance companies collect an enormous quantity of text data from multiple sources. We present a strategy to make beneficial use of the large amount of information available in documents by exploiting natural language processing. After a brief review of the basics of text mining, we describe a case study in which, by analyzing the accident narratives written by the researchers of the National Highway Traffic Safety Administration of the U.S. Department of Transportation, we aim to extract latent information that can be used to fine-tune policy premiums. The process involves two steps. First, we classify the reports according to the relevance of their content to determine the risk profiles of the people involved. Next, we use these profiles to create new latent risk covariates for a company’s ratemaking process.

Zappa, D., Borrelli, M., Clemente, G. P., Savelli, N., Text Mining in Insurance: From Unstructured Data to Meaning, <<VARIANCE>>, 2021; 2021 (1): 1-15 [https://hdl.handle.net/10807/224967]

Text Mining in Insurance: From Unstructured Data to Meaning

Zappa, Diego
Primo
;
Clemente, Gian Paolo
Penultimo
;
Savelli, Nino
Ultimo
2021

Abstract

Every day, insurance companies collect an enormous quantity of text data from multiple sources. We present a strategy to make beneficial use of the large amount of information available in documents by exploiting natural language processing. After a brief review of the basics of text mining, we describe a case study in which, by analyzing the accident narratives written by the researchers of the National Highway Traffic Safety Administration of the U.S. Department of Transportation, we aim to extract latent information that can be used to fine-tune policy premiums. The process involves two steps. First, we classify the reports according to the relevance of their content to determine the risk profiles of the people involved. Next, we use these profiles to create new latent risk covariates for a company’s ratemaking process.
2021
Inglese
Zappa, D., Borrelli, M., Clemente, G. P., Savelli, N., Text Mining in Insurance: From Unstructured Data to Meaning, <<VARIANCE>>, 2021; 2021 (1): 1-15 [https://hdl.handle.net/10807/224967]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/224967
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact