Sara Tonelli
Head of unit
- E-mail: satonelli@fbk.eu
- Phone: +39 0461 314542
- Google Scholar: My citations
- Semantic Scholar: Profile
- DBLP: Profile
- Twitter: Profile
Short bio
Since 2013 I am the head of the Digital Humanities research group at FBK. I am currently involved in the H2020 ODEUROPA project, where I lead the work-package related to olfactory information extraction. I am also part of several other interesting European projects, for example PROTECTOR, STAND BY ME and SCAN2, more details here. I am currently involved in the HYBRIDS MSCA network, whose goal is to fight disinformation using AI and human-in-the-loop approaches. Between January 2021 and December 2022 I was also the scientific coordinator of the KID ACTIONS European project, aimed at addressing cyberbullying among children and adolescents through interactive education and gamification.
I have a Phd in Language Sciences from Università Ca' Foscari, Venice. In 2020 I got the national habilitation as Associate Professor (seconda fascia) for the area `Information Systems' 09/H1. I am also a member of ELLIS, the European Laboratory for Learning and Intelligent Systems, and an appointed Honorary Fellow (cultore della materia) in Computational Linguistics L-LIN/01 at Università di Pavia, Italy. I am currently serving as Liaison Representative of the ACL Special Interest Group on Language Technologies for the Socio-Economic Sciences and Humanities (SIGHUM) and I am part of the board of the Italian Association for Computational Linguistics (AILC). I am also a member of the External Advisory Board of the LT-Bridge project.
I was involved in the past in several European projects: Pescado (FP7 - keyword extraction), Terence (FP7 - event-based text simplification), NewsReader (FP7 - event extraction and semantic role recognition), SIMPATICO (H2020 - text simplification in the administrative domain), HATEMETER (REC - social media monitoring for islamopohobia detection), CREEP (EitDIGITAL - Cyberbullying detection).
In 2022, I was area co-chair for "Digital Humanities and Cultural Heritage" at LREC and area chair for the "Offensive and Non Inclusive Language Detection and Analysis" track at COLING. I am currently diversity and inclusion co-chair at EACL 2023.
Activities
Students' supervision:
Current Phd Students:
2022 - present: Nicolò Penzo (co-advised with Bruno Lepri and Marco Guerini) "Analysing the effect of counter-narratives on hateful conversations online", ICT Doctoral School, University of Trento
2021 - present: Teresa Paccosi, "Extraction of Olfactory Information from Texts", Doctoral School in Cognitive Science, University of Trento
2020 - present: Camilla Casula "Multilingual abusive language detection", ICT Doctoral School, University of Trento
2019 - present: Daniela Trotta (co-advised with Annibale Elia) "Multimodal political communication", Dept. of Linguistics, University of Salerno
Past Phd students:
2018 - 2022: Federico Bonetti, "Gamification for Linguistic Annotation", Doctoral School in Cognitive Science, University of Trento
2017 - 2022: Matteo Lorenzini (co-advised with Marco Rospocher) "Automatic quality improvement and content enrichment of digital cultural heritage data", ICT Doctoral School, University of Trento
2016 - 2020: Lorenzo Lucchini (co-advised with Bruno Lepri), "Modeling and forecasting cultural dynamics with natural language processed data", ICT Doctoral School, University of Trento
2013 - 2018: Stefano Menini, "Automatic Analysis of Agreement and Disagreement in the Political Domain", ICT Doctoral School, University of Trento
2013 - 2018: Rachele Sprugnoli, "Event detection and classification for the Digital Humanities", ICT Doctoral School, University of Trento
2012 - 2016: Paramita Mirza, "Extracting temporal and causal relations between events", ICT Doctoral School, University of Trento [arXiv]
Research topics
Social media analysis and Hate speech detection: CREEP and Hatemeter projects, KID ACTIONS project, PROTECTOR project, PERCEPTIONS project.
Event and temporal/causal processing: Terence and NewsReader European projects, Rachele Sprugnoli's thesis, Paramita Mirza's thesis [link to software, poster, download CausalTimeBank]
Agreement, Disagreement and Argumentation Mining, especially in the political domain: collaboration with INRIA & University of Nice, Stefano Menini's thesis [link to data, link to datasets], collaboration with University of Mannheim (honorable mention at EMNLP 2017 for joint paper)
Digital Humanities, Historical Data Processing, Digital Cultural Heritage: ALCIDE project [link to demo], project on Epistolario De Gasperi and Edizione Nazionale Aldo Moro, Matteo Lorenzini's thesis work. Current ODEUROPA project.
Text Simplification: Terence and SIMPATICO European projects, MUSST syntactic simplifier [link to software], ERNESTA simplification tool, SIMPITIKI corpus for Italian Simplification [link to dataset]
FrameNet, Keyword Extraction and Terminology: my Phd thesis work [link to Italian FrameNet data], collaboration with Luciano Serafini to create a FrameNet-based resource for the Semantic Web [link to data], Keyphrase Digger extraction tool [link to demo and software].