top of page
1646141194939.jpg

Caterina Bonan

Incoming Services Data Scientist @ Five9

Current residence:

Cambridge, UK.

Settlement status:

Pre-settled.

  • LinkedIn
  • GitHub
  • Wordpress
  • Gmail
  • WhatsApp

A Bit About Me

I am an expert in syntactic theory, language documentation and data modelling, currently employed as a postdoctoral researcher at the University of Cambridge.

I am fond of Deep NLP techniques for human-computer interaction purposes. A big advocate for OpenSource, I love collaborating on translation projects to make NLP and ML accessible to everyone (most recently, the 🤗 course) and the development of big language models (at present, polyglot).

Since June, I have been a Data and AI trainee at AiCore, where I am specialising in Machine Learning and Data Engineering. Lately, I have taken part in numerous projects ranging from image classification and webscraping, the creation of conversational chatbots and recommendation systems, and the development of a corpus of spoken interactions in Romance (pypelet).

Recent work history

May 2022 - present

September 2019 - present

Ancora 1

September 2015 - August 2019

MACHINE LEARNING AND DATA ENGINEERING TRAINEE

AiCore (London)

Training in AI & Data using industry-standard tools. Working on industry projects throughout (cf. Experience Through Projects).

​

Skills: Software engineering (Git & GitHub, advanced Python, algorithms & data structures); Data engineering (SQL, data lakes, data warehousing, web scraping); Cloud Engineering (cloud computing, designing/building APIs, Docker, Apache Airflow, AWS Serverless Stack).

POSTDOCTORAL RESEARCHER IN THEORETICAL LINGUISTICS

University of Cambridge (UK)

Worked on 30+ varieties throughout. Published 8 peer-reviewed research articles, 2 edited volumes and 1 research monograph. Awarded a SNSF mobility scholarship twice.
 

Tools: Excel, Python, MatPlotLib, Selenium, LaTeX, VS code, Word and Power Point.

Links to scientific writing samples: Cadernos, IOS Press, Glossa, Isogloss.

RESEARCH ASSISTANT IN THEORETICAL LINGUISTICS

University of Geneva (Switzerland)

Fully-funded doctoral research as part of a SNSF lab. Worked on standard and non-standard French, and 50+ northern Italian varieties. Published 3 research articles, 1 co-edited volume, and a PhD dissertation.

 

Tools: Excel, Python, Praat, LaTeX, Word and Power Point.

Education

September 2015 - August 2019

December 2015 - September 2017

September 2011 - June 2013

September 2008 - August 2011

PHD IN GENERAL LINGUISTICS (dissertation).

University of Geneva (Switzerland)

CERTIFICAT DE SPECIALISATION EN LINGUISTIQUE

University of Geneva (Switzerland)

MA IN THEORETICAL LINGUISTICS

Università Ca' Foscari Venezia (Italy)

BA IN SCIENCE OF LANGUAGE

UniversitÀ Ca' Foscari Venezia (Italy)

Skills

Linguistics

Syntax

International Phonetic Alphabet

Corpus linguistics

Romance languages

Language documentation

Natural Language Processing

Deep NLP

Research

Quantitative and qualitative methods

Data visualisation

Data analysis

Storytelling

Scientific writing

Public speaking

Programming

Python (proficient)

Java (basic)

LaTeX (intermediate)

Libraries & tools

Selenium

pandas

NLTK

pytorch

MatPlotLib

spaCy

Languages

Italian (native)

French (late bilingual)

English (proficient)

Spanish (elementary)

Experience through projects

See Portfolio page.

​

    bottom of page