Welcome to inspiration, learning, instruction, networking and collaboration! Buy your ticket here!
Back To Schedule
Thursday, September 10 • 14:00 - 15:15
Who said text isn't data? Text analysis for data journalists with R

Log in to save this to your schedule, view media, leave feedback and see who's attending!

Is text "data" too? In this 1 hour session, you will learn how to use TidyText package for R statistical programming language in the RStudio development environment and turn text into newsworthy insights for your story. You will learn how to import texts into the environment and perform basic analyses for word usage in documents, how to analyse and visualise text and learn the basics of sentiment analysis. Experience with R, RStudio, and the Tidyverse is recommended, but not mandatory.

This is a demonstration, so we won't be fixing your errors, but if you would like to follow along, you can follow these instructions for installing Rstudio

avatar for Adriana Homolová

Adriana Homolová

Proud coordinator of the data skills training 📊, ARENA, Follow The Money, Lost in Europe
Adriana is a freelance data journalist, trainer and public spending nerd. At Dataharvest, she coordinates the data skills training. At other times, she writes scrapers and investigates European Union for Follow The Money's Bureau Brussel and collects data on missing children in migration... Read More →

avatar for Rui Barros

Rui Barros

data journalist, Público
Portuguese data journalist currently working at Público. In a relationship with R, loves to build things on the web. The solo coder in his newsroom, dreaming about the day where he’ll be just one more on a data team, so he doesn't have to debug his code alone.

Thursday September 10, 2020 14:00 - 15:15 CEST