profile picture

Lisa Hoek

Data Specialist

Who am I

Hi, I'm Lisa Hoek, a passionate and experienced Data Scientist, working from my campervan while exploring the beautiful countrysides of Europe.

With a deep-seated expertise in Python and many of its packages (e.g., pandas, matplotlib, sklearn), I thrive on transforming complex data into clear results. I specialize in machine learning, AI, Large Language Models, and Natural Language Processing (NLP), and have a particular affinity for working with both text and image data.

Part of my latest work revolves around transcribing 20th century Surinamese and Curaçaoan death certificates using Transkribus and GPT. My toolkit also includes robust skills in regular expressions, Named Entity Recognition and Linked Open Data.

I am also a prompt engineer adept at leveraging OpenAI's GPT models to craft intelligent and responsive applications. Having completed my Master's degree in Data Science enables me to adapt swiftly to new challenges and technologies.

Expertise

Prompt Engineering (LLMs)

Natural Language Processing

Text Recognition (Transkribus)

Machine Learning

Data Analysis

Computer Vision

Linked Open Data

Portfolio

View more
portfolio page smart rules

Smart Rule implementation using SpaCy

View more
portfolio page transcribing surinamese death certificates

Transcribing 20th century Surinamese death certificates using Transkribus and GPT

View more
portfolio page magic pen pal prompt revision

GPT Prompt revision for kids' magic pen pal

View more
portfolio page age discrimination in job advertisements

Detecting Age Discrimination in Job Advertisements using GPT and BERT

View more
portfolio page transcribing curaçaoan death certificates

Transcribing 19-20th century Curaçaoan death certificates using Transkribus and GPT

View more
portfolio page master thesis extracting entities from handwritten civil records using htr and regexes

Extracting Entities from Handwritten Civil Records using HTR and RegExes (MSc thesis)

View more
portfolio page pull factor regexes in job advertisements

Pull-Factor Regular Expressions in Job Advertisements

View more
portfolio page named entities in job advertisements

Named Entities in Job Advertisements
(Internship Data Science)

portfolio page bachelor thesis building a web classification system using dmoz

Building a web classification system using DMOZ (BSc thesis)

What our colleagues are saying

Get in Touch

I'm currently available for new projects, feel free to get in touch to chat.

I'd love to hear from you!

info@hookedondata.nl