Loading…
Welcome to inspiration, learning, instruction, networking and collaboration! Buy your ticket here!
Tuesday, September 29 • 14:00 - 15:15
Lessons learned extracting data from documents

Log in to save this to your schedule, view media, leave feedback and see who's attending!


Some of the most interesting datasets started life 'unstructured' -- as documents, emails, web pages, images, videos, and other formats that look nothing like a spreadsheet. This talk will cover the challenges in extracting data from these formats, what tools are available, and approaches for verifying the results. No existing technical knowledge required.

Moderators
avatar for Adriana Homolova

Adriana Homolova

ARENA / Follow The Money, Austria/ Slovakia
Adriana is a freelance data journalist, trainer and public spending nerd. She coordinates the data skills training track on the Dataharvest conference and investigates the European Union for Follow The Money Bureau Brussel.

Speakers
avatar for Max Harlow

Max Harlow

Financial Times
Max Harlow works on the visual and data journalism team at the Financial Times, focusing on investigations. He also runs Journocoders, a group for journalists to develop technical skills for use in their reporting.


Tuesday September 29, 2020 14:00 - 15:15 CEST
TBA