For reporters satellite data offers unique opportunities for original investigations and visual storytelling. But how do you get started? And what should you be looking out for? In a guest post for the Online Journalism Blog, MA Data Journalism student Niels de Hoogspeaks to four journalists who regularly work with satellite data about how to start, best practices and —most importantly — mistakes to avoid.
While researching my post on developing curiosity in journalism I came across Terry Heick‘s 4 stages of curiosity. It outlines 4 steps that learners go through as they grapple with new knowledge: firstly finding out what they are expected to do (the process); then understanding the content involved; then how to transfer that to particular situations; and finally how it applies to, and changes, them.
But the same model can also be adapted to provide a framework for investigations. Here’s how:
In an extract from a new chapter in the ebook Finding Stories in Spreadsheets, I explain what regular expressions are — and how they can be used to extract information from spreadsheets.The ebook version of this tutorial includes a dataset and exercise to employ these techniques.
The story was an unusual one: the BBC Data Unit had been given access to a dataset on more than 200,000 works of art in galleries across the UK. What patterns could we find in the data that would allow us to tell a story about the nature of the nation’s paintings?
Some of the data was straightforward to work with: the ‘artist’ column was relatively clean, and allowed us to identify the most common male and female artist. It turned out that the latter – the Victorian botanist Marianne North – was relatively unknown. So, that was one story we could tell.
But other parts of the data were more problematic. The date column, for example, contained inconsistently formatted data: in the majority of cases a specific year had been entered, but in many others the data contained text such as “18th century” or “1900-1920” or “1800s”.
We also noticed that monarchs featured heavily in the art – but understandably there was no column that was specifically dedicated to classifying those. If we wanted to identify the most-painted monarchs we would have to create new data that somehow extracted those names from the paintings’ titles.
These problems – extracting data from existing data, particular text data – are what regular expressions are designed for. In this chapter I will explain what regular expressions are, and how to use them in spreadsheets.
Nas minhas aulas e treinamentos de jornalismo de dados, costumo falar sobre os tipos mais comuns de histórias que podem ser encontradas em bancos de dados. Então, selecionei 100 reportagens baseadas em dados, analisei-as e verifiquei com qual frequência cada um desses ângulos é utilizado.
Cheguei à conclusão de que, na verdade, existem sete ângulos principais para reportagens e histórias baseadas em dados. Muitas histórias incorporam outros ângulos como dimensões secundárias da narrativa (uma história de mudança pode passar a falar sobre a escala de algo, por exemplo), mas todas as histórias de jornalismo de dados que examinei levaram um desses ângulos como fio-condutor.
Neste post, examino como os sete ângulos mais comuns podem ajudar você a ter ideias para histórias e reportagens, assim como a variedade de execuções e as principais considerações para se ter em mente.
Government says journalist “extracted data improperly” — but the journalist affirms that he only used a browser’s Inspect Element tool, reports Beatriz Farrugia.
Data journalism has been at the centre of a political debate in Brazil for two weeks after President Jair Bolsonaro’s government made allegations against a data journalist — for extracting data from a web app developed by the Brazilian Ministry of Health to prescribe treatments against COVID-19.
The TrateCov app was launched in January 2021 for Brazilian doctors. Professionals were told they would be able to enter a patient’s profile and symptoms into the app, which would then suggest medication.
However, the data journalist Rodrigo Menegat analyzed the app’s source code and found that, regardless of the patient’s symptoms, age and health conditions, TrateCov indicated the use of chloroquine, hydroxychloroquine and ivermectin — drugs with no scientific evidence supporting their use in the treatment of coronavirus.
He announced his discovery on 20 January in a series of tweets. “Guys,” he wrote:
“I just put in the TrateCov app that my patient is a one week-old newborn who has a stomach ache and a runny nose. The app recommended chloroquine, ivermectin, azithromycin and everything else. Crime, crime, crime, crime.”
Other journalists and broadcasters tested the app and came to the same conclusion.
Soon after the complaints, the app was removed by the Brazilian Government.
Accused of committing cyber crime
Then on May 25th, during a public session of a parliamentary inquiry, Menegat was accused of having committed cyber crime by an official of the Brazilian Ministry of Health: Mayra Pinheiro.
The parliamentary inquiry, opened late last month, is investigating the Bolsonaro government’s response to the pandemic. More than 461,000 people have died in Brazil so far.
Approved by Brazil’s Supreme Court, the inquiry is pursuing multiple lines of investigation, such as why the Brazilian government promoted ineffective treatments and why three health ministers were removed over the pandemic.
Naming the data journalist, Pinheiro said Menegat performed an “improper data extraction”.
“He was unable to hack,” said Mayra. “He did an improper data extraction. Hacking is when you use someone’s password, enter a platform, a system. The term is not hacking. Today we have the official report that classifies it as improper data extraction.
“He did improper simulations. [The system] was taken down for investigation.”
In another testimony session to the parliamentary inquiry the previous week the former Health Minister General Eduardo Pazuello said that the app had been “stolen and hacked by a citizen”.
After the allegations the data journalist explained that he had only used the browser’s Inspect Element tool to analyse the source code.
“As a data journalist and developer, I only analyzed the source code which was public and available on the website of the TrateCov app, saved on a government server (https://tratecov.saude.gov.br) and accessible to any internet user curious enough to do this verification on their own.”
“The procedure has in no way altered any content on the platform”, he added.
Since the allegations Menegat has limited his social media accounts to avoid online attacks by government supporters.
“I am closing my Twitter account for more than an obvious reason, but I will be very pleased to show who wants to know how to use the Element Inspector to access source code from any website in the world,” wrote the journalist.
Other Brazilian data journalists showed support for Menegat and published content explaining the technique used to analyse the app.
Eariler this week I came across a fantastic example of using company accounts in journalism: Guardian media editor Jim Waterson‘s article on how the family that founded the sex-worker social media platform OnlyFans “extracted tens of millions of pounds from its parent company in the last year”.
THREAD! I've reverse-engineered @jimwaterson's excellent article on sex-worker social media platform OnlyFans to show you how to find all sorts of stories in company accounts. Here's the article – https://t.co/mnkRYcyAru – now let's begin…🧵 #bcujournos
Journalism students now need training for 'vile' online abuse they will face, study finds + tips for dealing with ugly social media https://t.co/lXn3bQKJOE
Online abuse of journalists has become so routine that a study this week recommends that journalism students need training to prepare them for it.
But how do you incorporate online abuse into journalism curricula? Over the past year I’ve been trying to do just that — here are some ideas.
1. Ask about online abuse on news days
Most journalism courses already have news days, so considering online abuse in news meetings is a simple way to incorporate the topic into teaching without having to create new classes or materials.
“The objective is to normalize discussions about online harassment in the newsroom,” they say. “An important step toward a creating a culture of safety. Journalists should feel comfortable coming forward with their experiences and concerns.”
This is especially important in election weeks: the IPI suggest that discussions about online abuse should be more frequent during politically tense periods or ahead of elections.
If the outlet you use for student journalism has its own style guide or guidelines, you can also add a section about online abuse that outlines what they should do when it happens, and resources they can draw on, such as this Trollbusters infographic on actions to take regarding a range of threats, and those listed below.
2. Include online abuse in risk assessments
The section on psychological safety from the CPJ risk assessment template
Journalists and journalism students are often required to complete risk assessments ahead of certain newsgathering activity, such as filming on location.
Teaching on risk assessment can include these dimensions and point to those resources. It might be explained that risk factors include the nature of the story being covered: those relating to gender, ethnicity, immigration, extremism, disinformation, and other topics where there are strong feelings are likely to be higher risk than others.
This is also a good opportunity to talk about the risk factors related to the journalist’s own gender, ethnicity and sexuality, and how those aren’t limited to the online world: 73% of UNESCO research (PDF) respondents identifying as women said they had experienced online violence, and one in five said that abuse had moved offline.
3. Cover both sides of harassment in media law…
Harassment laws (or dimensions of industry codes) have become increasingly used against journalists — but journalists are also increasingly having to report harassment under the same laws. Media law classes on privacy, harassment, and malicious communications can address the law both as something to consider when reporting, and something that the reporter and their employer can draw on themselves.
In addition to the law, social media companies themselves are increasingly policing content, a form of regulation which both journalists and their audiences are now subject to. Understanding how that works — and how and when online abuse can be reported on different platforms — is an important new dimension to media law and regulation.
4. …and information security, too
Many strategies of online abuse mean the target needs to consider information security
In the media law module on my MA journalism courses I teach information security. Why? Journalists, as I concluded in research in 2015, can no longer protect sources through legal channels alone, and therefore need to understand both technological and legal defences.
This year I extended that teaching to talk about personal information security in the context of online abuse as well.
PEN America‘s Online Harassment Field Manual compiles “strategies to protect and arm yourself before you become the target of abuse, with an emphasis on tightening your cybersecurity and establishing supportive online communities who will have your back”, addressing a number of tactics used by online abusers, from doxxing and impersonation to hacking. Feminist Frequency also provides a guide to online safety.
The Data Detox Kit also outlines “steps you can take to control your digital privacy, security, and wellbeing in ways that feel right to you.”
5. Include guidelines in classes on writing for social media
Social media is the main space where online abuse takes place, so classes on writing for those platforms, and related skills such as community management (the IPI has a whole section on audience moderation), are an obvious place to address some of the techniques for dealing with online abuse.
One of those techniques, for example, is to “flood social media with positive posts about the work of colleagues to drown out any abuse they may be facing”.
“Keep work and personal social media accounts separate
“Use strict settings to filter out trolls — block, ignore and mute personal attacks
“Switch off outside office hours
“If the accuracy of a story is being called into question and you choose to respond, remain factual — but don’t expect to have the last word, as you can never win a ‘Twitter spat’
“Differentiate between attacks (personal) and criticism (on journalistic grounds)
“Document and threats or abuse
“Report abuse to management and use internal processes that are in place
“Speak about it with family, friends or colleagues to “take a bit of the heat out”
“Do things you enjoy outside work
“Remember abuse is never about your ability
“Take time away from social media if necessary and ask for an editor’s support to do so
“Know it is okay to be upset and that abuse is not acceptable and should not be part of the job.”
The International Women’s Media Federation (IWMF) has a Know your trolls course which can also be incorporated into social media training.
6. Organise a screening of A Dark Place
Earlier this month I organised a screening and Q&A around the one-hour documentary A Dark Place, which highlights “the experiences of female journalists who have been targeted by online harassment”.
It’s a must-see documentary, not least in understanding the gendered dimension of online violence, as outlined in a recent UNESCO report:
“Women journalists are both the primary targets of online violence and the first responders to it.
“Misogyny is one of the key features of online violence targeting women journalists, and it has been routinised … In detail and delivery, the threats are personal [and] they are often highly sexualised.”
7. Ask guest speakers about how they handle online abuse
There is a good chance that a planned guest speaker will have had to deal with some form of online abuse themselves. If you feel that this might be relevant or valuable to students it may be worth asking the speaker in advance if this is the case and if they are willing to talk about that as part of the session.
Hearing respected industry figures talk about online abuse makes the topic more concrete and manageable, while helping students to put it into the context of industry practice, support systems, and ongoing developments.
8. Include online abuse as a critical issue in academic classes
Most journalism courses require students to study the critical issues surrounding the profession. Online abuse — either on its own or alongside related issues such as misinformation, privacy, social media, diversity and/or propaganda — is a strong candidate to be included as one of those. Resources you can draw on and point students to include:
The latest frequently asked questions post focuses on questions from a Masters student interested in the effect of the rise of online news on journalism ethics.
Do you think that the ethical codes of journalism have changed in the transition from traditional journalism to digital?
I think the ethics of journalism have changed, yes, for a range of reasons, and in both negative and positive ways. For example, transparency has become much more highly valued as a journalistic value: journalists are expected to earn the trust of readers much more than was previously the case, and I would argue that is a positive development. Linking to sources, sharing methodologies, etc. forces journalists to hold themselves to higher standards.
Continue reading →
Al Jazeera’s interactive team AJ Labs have a mantra: “human driven data journalism”. In a guest post for OJB Hanna Duggal speaks to the team’s lead Mohammed Haddad on what this means and how he tackles big data, including a recent story commemorating the Arab Spring.
And, while many of the issues Al Jazeera covers are deeply complex, AJ Labs often help to explain such narratives using data journalism. Continue reading →
In a guest post for OJB,George Willoughby spoke to data journalist Rodrigo Menegat about reporting on Covid-19 in Brazil, managing uncertainty and how data journalism could help debunk misinformation.
At the height of the first wave of the coronavirus pandemic in March, data on the disease was in high demand. It required collaboration — something made more difficult with data lacking in quality.
Having spent most of his career covering politics, last year Rodrigo Menegat realised that science data — particularly Covid-19 data — was fast becoming a staple in the newsroom.
“The first challenge was learning how to cover data which is very different to sport or politics,” he says.
The difficulty was understanding something that, as a country, Brazil was not ready to face. Continue reading →