Every day insurance companies collect an enormous quantity of text data from multiple sources. By exploiting Natural Language Processing, we present a strategy to make beneficial use of the large information available in documents. After a brief review of the basics of text mining, we describe a case study where, by analyzing the accident narratives written by the researchers of the National Highway Traffic Safety Administration (NHTSA) of the U. S. Department of Transportation, we aim at grasping latent information useful to fine-tune policy premiums. The process is based on two steps. First, we classify the reports according to the relevance of their content to find the risk profile of the people involved. Next we use these profiles to add new latent risk covariates for the ratemaking process of the customers of a company.

Zappa, D., Clemente, G. P., Borrelli, M., Savelli, N., Text mining in insurance: from unstructured data to meaning, <<VARIANCE>>, 2021; 2021 (14): 1-15 [https://hdl.handle.net/10807/129515]

Text mining in insurance: from unstructured data to meaning

Zappa, Diego;Clemente, Gian Paolo;Savelli, Nino
2019

Abstract

Every day insurance companies collect an enormous quantity of text data from multiple sources. By exploiting Natural Language Processing, we present a strategy to make beneficial use of the large information available in documents. After a brief review of the basics of text mining, we describe a case study where, by analyzing the accident narratives written by the researchers of the National Highway Traffic Safety Administration (NHTSA) of the U. S. Department of Transportation, we aim at grasping latent information useful to fine-tune policy premiums. The process is based on two steps. First, we classify the reports according to the relevance of their content to find the risk profile of the people involved. Next we use these profiles to add new latent risk covariates for the ratemaking process of the customers of a company.
2019
Inglese
Zappa, D., Clemente, G. P., Borrelli, M., Savelli, N., Text mining in insurance: from unstructured data to meaning, <<VARIANCE>>, 2021; 2021 (14): 1-15 [https://hdl.handle.net/10807/129515]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10807/129515
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact