Welcome to inspiration, learning, instruction, networking and collaboration! Buy your ticket here!
Back To Schedule
Tuesday, September 29 • 14:00 - 15:15
Lessons learned extracting data from documents

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Some of the most interesting datasets started life 'unstructured' -- as documents, emails, web pages, images, videos, and other formats that look nothing like a spreadsheet. This talk will cover the challenges in extracting data from these formats, what tools are available, and approaches for verifying the results. No existing technical knowledge required.

avatar for Adriana Homolova

Adriana Homolova

Proud coordinator of the DH data training 📊, Freelance data journalist

avatar for Max Harlow

Max Harlow

Financial Times
Max Harlow works with the visual and data journalism team at the Financial Times in London. He has previously worked on investigations at the Guardian and at the Bureau of Investigative Journalism. He co-runs Journocoders, a group for journalists who want to develop technical ski... Read More →

Tuesday September 29, 2020 14:00 - 15:15 CEST