DH Seminar "Literary genre classification with syntactic and lexical features"

Monday, 24 March, 2014 - 15:00 to 16:00

Speaker: Andreas van Cranenburgh, Phd student, UvA Amsterdam

Title: Literary genre classification with syntactic and lexical features


In the context of the project "The Riddle of Literary Quality", we aim to find correletations between reader evaluations of novels and textual features. The goal is to be able to describe what makes readers consider texts literary and/or good. The reader evaluations were collected in a large survey; however, the results have not been analyzed yet.
Therefore in this talk I will look at genres as a proxy for literary quality. I will show some results on the classification of genres of texts using support vector machines based on lexical and syntactic features.

